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ENGINEERING INTRACELLULAR SIALYLATION PATHWAYS 

[0100] This application claims benefit under 35 U.S.C. section 1 19(e) based on 
copending U.S. Provisional Application Serial No. 60/227,579, filed August 25, 2000, 
which is herein incorporated by reference in its entirety, and also claims benefit under 
35 U.S.C. section 120 to U.S. Application Serial No. 09/516,793, filed March 1, 2000, 
which is herein incorporated by reference in its entirety and which claims benefit under 
35 U.S.C. section 1 19(e) based on U.S. Provisional Applications Nos. 60/169,624, filed 
December 8, 1999, and 60/122,582, filed March 2, 1999, both of which are herein 
incorporated by reference in their entireties. 

[0101] Part of the work performed during the development of this invention 
utilized U. S. Government funds in the form of grants from the National Science 
Foundation, Grant Numbers BES9814157, BES9814100, and the National 
Institutes of Health, Grant Number ROl-GM-49734. The U.S. Government has 
certain rights in this invention. 

FIELD OF THE INVENTION 
[0102] The invention relates to methods and compositions for expressing sialylated 
glycoproteins in heterologous expression systems, particularly insect cells. 

BACKGROUND OF THE INVENTION 
[0103] While heterologous proteins are generally identical at the amino acid level, 
their post-translationally attached carbohydrate moieties often differ from the 
carbohydrate moieties found on proteins expressed in their natural host species. Thus, 
carbohydrate processing is specific and limiting in a wide variety of organisms 
including insect, yeast, mammalian, and plant cells. 

[0104] The baculovirus expression vector has promoted the use of insect cells as 
hosts for the production of heterologous proteins (Luckow et al. (1993) Curr. Opin. 
Biotech. 4:564-572, Luckow et al. (1995) Protein production and processing from 
baculovirus expression vectors). Commercially available cassettes allow rapid 
generation of recombinant baculovirus vectors containing foreign genes under the 
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control of the strong, polyhedrin promoter. This expression system is often used to 
produce heterologous secreted and membrane-bound glycoproteins normally of 
mammalian origin. 

[0105] However, post-translational processing events in the secretory apparatus of 
insect cells yield glycoproteins with covalently-linked oligosaccharide attachments that 
differ significantly from those produced by mammalian cells. While mammalian cells 
often generate complex oligosaccharides terminating in sialic acid (SA), insect cells 
typically produce truncated (paucimannosidic) and hybrid structures terminating in 
mannose (Man) or N-acetylglucosamine (GlcNAc) (Figure 1). The inability of insect 
cell lines to generate complex carbohydrates comprising sialic acid significantly limits 
the wider application of this expression system. 

[0106] The carbohydrate composition of an attached oligosaccharide, especially 
sialic acid, can affect a glycoprotein's solubility, structural stability, resistance to 
protease degradation, biological activity, and in vivo circulation (Goochee et al. (1991) 
Bio/technology 9:1347-1355, Cumming et al (1991) Glycobiology 1:115-130, 
Opdenakker et al (1993) FASEB J. 7:1330, Rademacher et al (1988) Ann. Rev. 
Biochem., Lis et al (1993) Eur. J. Biochem. 218:1-27). The terminal residues of a 
carbohydrate are particularly important for therapeutic proteins since the final sugar 
moiety often controls its in vivo circulatory half-life (Cumming et al (1991) 
Glycobiology 1 :1 15-130). Glycoproteins with oligosaccharides terminating in sialic 
acid typically remain in circulation longer due to the presence of receptors in 
hepatocytes and macrophages that bind and rapidly remove structures terminating in 
mannose (Man), N-acetylglucosamine (GlcNAc), and galactose (Gal), from the 
bloodstream (Ashwell et al (1974) Giochem. Soc. Symp. 40:1 17-124, Goochee et al 
(1991) Bio/technology 9:1347-1355, Opdenakker^ al (1993) FASEB J. 7:1330). 
Unfortunately, Man and GlcNAc are the residues most commonly found on the termini 
of glycoproteins produced by insect cells. The presence of sialic acid can also be 
important to the structure and function of a glycoprotein since sialic acid is one of the 
few sugars that is charged at physiological pH. The sialic acid residue is often involved 
in biological recognition events such as protein targeting, viral infection, cell adhesion, 
tissue targeting, and tissue organization (Brandley et al. (1986) J. of Leukocyte bio. 
40:97-111, Varkie/ al (1997) FASEB 11:248-255, Goochee et al (1991) 
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Bio/technology 9:1347-1355, Lopez et al (1997) Glycobiology 7:635-651, Opdenakker 
et al (1993) FASEB J. 7:1330). 

[0107] The composition of the attached oligosaccharide for a secreted or membrane- 
bound glycoprotein is dictated by the structure of the protein and by the post- 
radiational processing events that occur in the endoplasmic reticulum and Golgi 
apparatus of the host cell. Since the secretory processing machinery in mammalian cells 
differs from that in insect cells, glycoproteins with very different carbohydrate 
structures are produced by these two host cells (Jarvis et al (1995) Virology 212:500- 
511, Maru et al (1996) J. Biol Chem. 271 : 16294-16299, Altmann et al (1996) Trends 
in Glycoscience and Glycotechnology 8:101-1 14). These differences in carbohydrate 
structure can have dramatic effects on the in vitro and in vivo properties of the resulting 
glycoprotein. For example, the in vitro activity of human thyrotropin (hTSH) expressed 
in insect cells was five times higher than the activity of the same glycoprotein produced 
from mammalian Chinese hamster ovary (CHO) cells (Grossman et al (1997) 
Endocrinology 138:92-100). However, the in vivo activity of the insect cell-derived 
product was substantially lower due to its rapid clearance from injected rats. The drop 
in in vivo hTSH activity was linked to the absence of complex-type oligosaccharides 
terminating in sialic acid in the insect cell product (Grossman et al (1997) 
Endocrinologyl 138:92-100). 

[0108] N-glycosylation is highly significant to glycoprotein structure and function. 
In insect and mammalian cells N-glycosylation begins in the endoplasmic reticulum 
(ER) with the addition of the oligosaccharide, Glc 3 Man 9 GlcNAc 2 onto the asparagine 
(Asn) residue in the consensus sequence Asn-X-Ser/Thr (Moremen, et al (1994) 
Glycobiology 4:1 13-125, Varki et al (1993) Glycobiology 3(2):97-130, Altmann et al 
(1996) Trends in Glycoscience and Glycotechnology 8:101-1 14). As the glycoprotein 
passes through the ER and Golgi apparatus, enzymes trim and add different sugars to 
this N-linked glycan. These carbohydrate modification steps can differ in mammalian 
and insect hosts. 

[0109] In mammalian cell lines, the initial trimming steps are followed by the 
enzyme-catalyzed addition of sugars including N-acetylglucosamine (GlcNAc), 
galactose (Gal), and sialic acid (SA) by the steps shown in Figure 2, and as described in 
Goochee et al (1991) Bio/technology 9:1347-1355. 
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[0110] In insect cells, N-linked glycans attached to heterologous and homologous 
glycoproteins comprise either high-mannose (Man 9 -5GlcNAc 2 ) or truncated 
(paucimannosidic) (Man 3 - 2 GlcNAc 2 ) oligosaccharides; occasionally comprising 
alpha{\, 6)-fucose (Figure 3; Jarvis et al. (1989) Mol. Cell. Biol. 9:214-223, Kuroda et 
al. (1990) Virology 174:418-329, Marz et al. (1995) Glycoproteins 543-563, Altmann 
et al. (1996) Trends in Glycoscience and Glycotechnology 8:101-1 14). These reports 
primarily directed to Sf-9 or Sf-21 cells from Spodopterafrugiperda, indicated that 
insect cells could trim N-linked oligosaccharides but could not elongate these trimmed 
structures to produce complex carbohydrates. Reports from other insect cell lines, 
including Tricoplusia ni (T. ni; High Five™) and Estigmena acrea (Ea-4), indicated the 
presence of limited levels of partially elongated hybrid (structures with one terminal 
Man branch and one branch with terminal Gal, GlcNAc, or another sugar; Figure 4a) 
and complex (structures with two non-Man termini; Figure 4b) N-linked 
oligosaccharides (Oganah et al. (1996) Bio/Technology 14:197-202, Hsu et al. (1997) J. 
Biol. Chem. 272:9062-9070). Low levels of GlcNAc transferase I and II (GlcNAc TI 
and TIT), fucosyltransferase, mannosidases I and H, and Gal transferase (Gal T) have 
been reported in these insect cells; indicating a limited capability for production of 
these hybrid and complex N-linked oligosaccharides in these cells (Velardo et al. 
(1993) J. Biol. Chem. 268:17902-17907, Altmann et al. (1996) Trends in Glycoscience 
and Glycotechnology 8:101-1 14, van Die et al. (1996) Glycobiology 6:157-164). 
[01 1 1] However; most insect cell derived glycoproteins lack complex N-glycans. 
This absence maybe attributed to the presence of the hexosaminidase N- 
acetylglucosaminidase that cleaves GlcNAc attached to the alpha{\, 3) Man branch to 
generate paucimannosidic oligosaccharides (Licari et al. (1993) Biotech. Prog. 9:146- 
152, Altmann et al. (1995) J. Biol. Chem. 270:17344-17349). Chemicals have been 
added in an attempt to inhibit this glycosidase activity, but significant levels of 
paucimannosidic structures remain even in the presence of these inhibitors (Wagner et 
al. (1996) J. Virology 70:4103-4109). 

[0112] Manipulating carbohydrate processing in insect cells has been attempted; and 
in mammalian cells, the expression of sialyltransferases, galactosyltransferases and 
other enzymes is well established in order to enhance the level of oligosaccharide 
attachment (see U.S. Patent No. 5,047,335). However, in these cases, the presence of 
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the necessary donor nucleotide substrates, most significantly the sialylation nucleotide, 
CMP-sialic acid, in the proper subcellular compartment has been assumed. Attempts to 
manipulate carbohydrate processing have been made by expressing single transferases 
such as N-Acetylglucosamine transferase I (GlcNAc Tl), galactose transferase (GAL 
T), or sialyltransferase (Lee et al. (1989) J. Biol. Chem. 264:13848-13855, Wagner et 
al. (1996) Glycobiology 5:165-175, Jarvis et al. (1996) Nature Biotech. 14:1288-1292, 
Hollister et al. (1998) Glycobiology 5:473-480, Smith et al. (1990) J. Biol. Chem. 
265:6225-6234, Grabenhorst et al. (1995) Eur. J. Biochem. 232:718-725). Introduction 
of a mammalian beta{\, 4)-GalT using viral vectors (Jarvis et al. (1995) Virology 
212:500-511) or stably-transformed cell lines (Hollister et al. (1998) Glycobiology 
5:473-480) indicates that both approaches can enhance the extent of complex 
glycosylation of foreign glycoproteins expressed in insect cells. GlcNAcTl co- 
expression can increase the number of recombinant glycoproteins with oligosaccharides 
containing GlcNAc on the Man alpha{\, 3) branch (Jarvis et al. (1996) Nature Biotech. 
14:1288-1292, Jarvis et al. (1995) Virology 212:500-511, Hollister et al. (1998) 
Glycobiology 5:473-480; Wagner et al. (1996) Glycobiology 6:165-175). 
[0113] However, the production of complex carbohydrates comprising sialic acid 
has not been observed in these studies. Sialylation of a single recombinant protein 
(plasminogen) produced in baculovirus-infected insect cells has been reported 
(Davidson et al. (1990) Biochemistry 29:5584-5590), but findings appear to be specific 
to this glycoprotein. Conversely, many reports indicate the complete absence of any 
attached sialic acid on glycoproteins from all insect cell lines tested to date (Voss et al. 
(1993) Eur. J. Biochem. 217:913-919, Jarvis et al. (1995) Virology 212:500-51 1, Marz 
et al. (1995) Glycoproteins 543-563, Altmann et al. (1996) Trends in Glycoscience and 
Glycotechnology 8:101-1 14, Hsu et al. (1997) J. Biol. Chem. 272:9062-9070). 
[0114] The reason for this absence of sialylated glycoproteins was initially puzzling 
since polysialic acid structures were obtained in Drosophila embryos (Roth et al. 
(1992) Science 256:673-675). However, as demonstrated herein, it is now evident that 
insect cell lines generate very little sialic acid as compared to mammalian CHO cells 
(See Figure 16). With very little sialic acid, the insect cells cannot generate the donor 
nucleotide CMP-sialic acid essential for sialylation. A similar lack or limitation in 
donor nucleotide substrates may be observed in other eukaryotes as well. Thus, the co- 
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expression of sialyltransferase and other transferases must be accompanied by the 
intracellular generation of the proper donor nucleotide substrates and the proper 
acceptor substrates in order for the production of sialylated and other complex 
glycoproteins in eukaryotes. In addition, sialic acid and CMP-sialic acid are not 
permeable to cells so these substrates can not be provided directly to the medium of the 
cultures (Bennett et al (1981)7. Cell. Biol 55:1-15). 

[0115] The manipulation of post-translational processing is particularly relevant to 
biotechnology since recombinant DNA products generated in different hosts are usually 
identical at the amino acid level and differ only in the attached carbohydrate 
composition (Goochee et al (1991) Bio/technology 9:1347-1355). Engineering 
carbohydrate pathways is useful to make recombinant DNA technology more versatile 
and expand the number of hosts that can generate particular glycoforms. This 
flexibility could ultimately lower biotechnology production costs since host efficiency 
would be the primary factor dictating which expression system is chosen rather than a 
host's capacity to produce a specific glycoform. Furthermore, carbohydrate engineering 
is useful to tailor a glycoprotein to include specific oligosaccharides that could alter 
biological activity, structural properties or circulatory targets. Such carbohydrate 
engineering efforts will provide a greater variety of recombinant glyco-products to the 
biotechnology industry. 

[0116] Glycoproteins containing sialylated oligosaccharides would have improved in 
vivo circulatory half-lives that could lead to their increased utilization as vaccines and 
therapeutics. In particular, complex sialylated glycoproteins from insect cells would be 
more appropriate biological mimics of native mammalian glycoproteins in molecular 
recognition events in which sialic acid plays a role. 

[0117] Therefore, manipulating carbohydrate processing pathways in insect and 
other eukaryotic cells so that the cells produce complex sialylated glycoproteins is 
useful for enhancing, the value of heterologous expression systems and increasing the 
application of heterologous cell expression products as vaccines, therapeutics, and 
diagnostic tools; for increasing the variety of glycosylated products to be generated in 
heterologous hosts; and for lowering biotechnology production costs, since particular 
expression systems can be selected based on efficiency of production rather than the 
capacity to produce particular product glycoforms. 



SUMMARY OF THE INVENTION 
[0118] Compositions and methods for producing glycoproteins having sialylated 
oligosaccharides are provided. The compositions of the invention comprise enzymes 
involved in carbohydrate processing and production of nucleotide sugars, nucleotide 
sequences encoding such enzymes, and cells transformed with these nucleotide 
sequences. The compositions of the invention are useful in methods for producing 
complex sialylated glycoproteins in cells of interest including, but not limited to, 
mammalian cells and non-mammalian cells (e.g., insect cells). 

[0119] The sialylation process involves the post-translational addition of a donor 
substrate, cytidine monophosphate-sialic acid (CMP-SA) onto a specific acceptor 
carbohydrate (GalGlcNAcMan-R) via an enzymatic reaction catalyzed by a 
sialyltransferase in the Golgi apparatus. Since one or more of these three reaction 
components (i.e., acceptor, donor substrate, and the enzyme sialyltransferase) is 
limiting or absent in certain cells of interest, methods are provided to enhance the 
production of the limiting components. Polynucleotide sequences encoding the 
enzymes used according to the methods of the invention are known or novel bacterial 
invertebrate, fungal, or mammalian sequences and/or fragments or variants thereof, 
that are optionally identified using bioinformatics searches. According to one 
embodiment of the invention, completion of the sialylation reaction is achieved by 
expressing a sialyltransferase enzyme, or a fragment or variant thereof, in the presence 
of acceptor and/or donor substrates. The invention also provides an assay for 
sialylation, wherein the structures and compositions of N-linked oligosaccharides 
attached to a model secreted glycoprotein, (e.g., transferrin), is elucidated using 
multidimensional chromatography. 

[0120] Cells of interest that have been recombinantly engineered to produce new 
forms of sialylated glycoproteins, higher concentrations of sialylated glycoproteins, 
and/or elevated concentrations of donor substrates (.g., nucleotides sugars) required for 
sialylation, as well as kits for expression of sialylated glycoproteins are also provided. 



BRIEF DESCRIPTION OF THE DRAWINGS 
[0121] Figure 1 depicts the typical differences in insect and mammalian 
carbohydrate structures. 

[0122] Figure 2 depicts the enzymatic generation of a complex sialylated 
carbohydrate in mammalian cells. 

[0123] Figure 3 depicts a Paucimannosidic oligosaccharide. 

[0124] Figure 4a depicts a hybrid glycan from Estigmena acrea (Ea-4) insect cells. 
[0125] Figure 4b depicts a complex glycan from Estigmena acrea (Ea-4) insect 
cells. 

[0126] Figure 5 depicts the nucleotide sugar production pathways in mammalian and 
E. coli cells leading to sialylation. 

[0127] Figure 6 depicts a chromatogram of labeled oligosaccharides separated by 
reverse phase High Performance Liquid Chromatography (HPLC) on an ODS-silica 
column. Using this technique, oligosaccharides are fractionated according to their 
carbohydrate structures. Panel "L" represents cell lysate fractions and panel "S" 
represents cell supernatant fractions. 

[0128] Figure 7 depicts the structure of Oligosaccharide G. 

[0129] Figure 8 depicts the glycosylation pathway in Trichoplusia ni insect cells 

(High Five™ cells; Invitrogen Corp., Carlsbad, CA, USA). 

[0130] Figure 9 depicts the chromatogram of a Galactose-transferase assay 

following High Performance Anion Exchange Chromatography (HPAEC), as described 

in the Examples and references cited therein. 

[0131] Figure 10 depicts the chromatogram of a 2,3-Sialyltransferase assay 
following Reverse Phase-High Performance Liquid Chromatography (RP-HPLC), as 
described in the Examples. 

[0132] Figure 1 1 depicts the results of a Galactose-transferase (Gal-T) assay of 
insect cell lysates performed using a Europium (Eu +3 )-labeled Ricinus cummunis lectin 
(RCA 120) probe; which specifically binds Gal or GalNAc oligosaccharide structures 
as described in the Examples. Each column represents the Gal-T activity in a given 
sample; Column (A) represents boiled T. ni cell lysates, Column (B) represents normal 
T. ni cell lysates, Column (C) represents activity in 0.5 mU of enzyme standard, 
Column (D) represents lysate from T. ni cells infected with a baculovirus coding for 
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GalT, Column (E) represents lysates from Sf-9 cells stably transfected with the GalT 
gene. Figure 12 depicts the product of reacting UDP-Gal-6-Naph with Dans-AE- 
GlcNAc in the presence of GalT. 

[0133] Figure 12 depicts the reaction products resulting from incubation of UDP- 
Gal-6-Naph and Dans-AE-GlcNAc in the presence of Galactose-transferase, as 
described in the "Experimental" section below. 

[0134] Figure 1 3 depicts the distinguishing emission spectra of GalT assay reactants 
and products, as described in the "Experimental" section below. Irradiation of the 
naphthyl group in UDP-Gal-6-Naph at 260-290 nm ("ex") results in an emission peak 
at 320-370 nm ("em" dotted line) while irradiation of the Galactose-transferase reaction 
products at these same low wavelengths results in energy transfer to the dansyl group 
and an emission peak at 500-560 nm ("em" solid line). 
[0135] Figure 1 4 depicts the oxidation reaction of sialic acid. 
[0136] Figure 1 5 schematically depicts a new GlcNAc Tl assay utilizing a synthetic 
6-aminohexyl glycoside of the trimannosyl N-glycan core structure labeled with DTP A 
(Diethylenetriaminepentaacetic acid) and complexed with Eu +3 (see "Experimental" 
section below). This substrate is incubated with insect cell lysates or positive controls 
containing GlcNAc Tl and UDP-GlcNAc. Chemical inhibitors are added to minimize 
background N-acetylglucosaminidase activity. After the reaction, an excess of Crocus 
lectin CVL (Misaki et al. (1997) J. Biol. Chem. 272:25455-25461), which specifically 
binds the trimannosyl core, is added. The amount of lectin required to bind all the 
trimannosyl glycoside (and hence all the Eu +3 label) in the absence of any GlcNAc 
binding is predetermined. Following an ultrafiltration step, the glycoside modified with 
GlcNAc (not binding CVL) appears in the filtrate. Measurement of the Eu +3 
fluorescence in the filtrate reflects the level of GlcNAc Tl activity in the culture 
lysates. 

[0137] Figure 1 6 depicts a chromatogram of sialic acid levels in SF9 insect cells and 
CHO (Chinese hamster ovary) cells. In the panel labeled "Sf-9 Free Sialic Acid Levels" 
the known sialic acid standard elutes just prior to 10 minutes, while no corresponding 
sialic acid peak can be detected (above background levels) in Sf-9 cells. In the panel 
labeled "CHO sialic acid levels" the sialic acid standard elutes at approximately 9 
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minutes, while bound and free (released by acid hydrolysis) sialic acid peaks are 
observed at similar elution positions. 

[0138] Figure 17 depicts how selective inhibition of N-acetylglucosaminidase allows 
for production of complex oligosaccharide structures. 

[0139] Figure 18 depicts ethidium bromide-stained agarose gels following 
electrophoresis of PCR amplification products from Sf9 genomic DNA or High Five™ 
(Invitrogen Corp., Carlsbad, CA, USA) cell cDNA templates using degenerate primers 
corresponding to three different regions conserved within N-acetylglucosaminidases. 
[0140] Figure 19 depicts two potential specific chemical inhibitors of N- 
acetylglucosaminidase. 

[0141] Figure 20 schematically depicts that the overexpression of various 

glycosyltransferases leads to greater production of oligosaccharide acceptor substrates. 

[0142] Figure 21 depicts three possible N-glycan acceptor structures which include 

the terminal Gal (G) acceptor residue required for subsequent sialylation. 

[0143] Figure 22 depicts a structure of CMP-sialic acid (CMP-SA). 

[0144] Figure 23 depicts a metabolic pathway for ManNAc (N-acetylmannosamine) 

from glucosamine and N-acetylglucosamine (GlcNAc). 

[0145] Figure 24 depicts a ManNAc (N-acetylmannosamine) to sialic acid metabolic 
pathway. 

[0146] Figure 25 depicts the formation of CMP-sialic acid (CMP-SA) catalyzed by 
CMP-SA synthetase. 

[0147] Figure 26 depicts detection of purified (P) transferrin (hTf) or transferrin 
from unpurified insect cell lysates (M) following separation on an SDS-PAGE gel, as 
described the Examples. 

[0148] Figure 27 depicts the nucleotide sequence of human aldolase (SEQ ID 
NO.l). 

[0149] Figure 28 depicts the amino acid sequence of human aldolase (SEQ ID NO: 

2) encoded by the sequence shown in Figure 27 (SEQ ID NO: 1). 

[0150] Figure 29 depicts the nucleotide sequence of human CMP-SA synthetase 

(cytidine monophosphate-sialic acid synthetase) (SEQ ID NO: 3). 

[0151] Figure 30 depicts the amino acid sequence of human CMP-SA synthetase 

(SEQ ID NO: 4) encoded by the sequence shown in Figure 29 (SEQ ID NO: 3). 




[0152] Figure 31 depicts the nucleotide sequence of human sialic acid synthetase 
(human SA-synthetase; human SAS) (SEQ ID NO: 5). 

[0153] Figure 32 depicts the amino acid sequence of human SA-synthetase (SAS) 
(SEQ ID NO: 6) encoded by the sequence shown in Figure 31 (SEQ ID NO: 5). 
[0154] Figure 33 depicts the types and quantities of oligosaccharide structures found 
on recombinant human transferrin in the presence and absence of Gal T overexpression. 
[0155] Figure 34 depicts bacterial and mammalian sialic acid metabolic pathways. 
[0156] Figure 35 A-C depicts an alignment of the polypeptide (SEQ ID NO: 6) 
encoded by the human SAS polynucleotide open-reading frame (SEQ ID NO:5). 
[0157] Figure 35 D depicts the amino acid sequence homology between human 
SAS (top sequence) (SEQ ID NO: 6) and bacterial sialic acid synthetase (NeuB) 
(bottom sequence) (SEQ ID NO: 8). 

[0158] Figure 36 (A) depicts an autoradiogram of human sialic acid synthetase gene 
products following gel electrophoresis. The lanes labeled "In Vitro" represent in vitro 
transcription and translation products of SAS cDNA (amplified via polymerase chain 
reaction (PCR)). Lane 1 ("pA2") depicts a negative control reaction in which pA2 
plasmid (without the SAS cDNA) was PCR amplified, transcribed, translated, and 
radiolabled. Lane 2 ("pA2-SAS ") depicts a sample reaction in which pA2-S AS 
plasmid (containing the human SAS cDNA) was PCR amplified, transcribed, 
translated, and radiolabeled. Lane 3 ("Marker") depicts radiolabeled protein standards 
migrating at approximately 66, 46, 30, 21.5, and 14.3 kD. The lanes labeled "Pulse 
Label" show radioactive 35 S pulse labeling of polypeptides from insect cells infected by 
virions not containing or containing the human SAS cDNA. Lane 4 ("A35") depicts a 
negative control reaction of radiolabled polypeptides from insect cells infected with 
virions not containing the SAS cDNA. Lane 5 ("AcSAS") depicts a sample reaction of 
radiolabeled polypeptides from insect cells infected with baculovirus containing the 
human SAS cDNA. Figure 36 (B) depicts an RNA (Northern) blot of human tissues 
(spleen, thymus, prostate, testis, ovary, small intestine, peripheral blood lymphocytes 
(PBL), colon, heart, brain, placenta, lung, liver, skeletal muscle, kidney, and pancreas) 
probed for sialic acid synthetase RNA transcripts. Transcript sizes (in kilobases) are 
indicated by comparison to the scale on the left side. 
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[0159] Figure 37 depicts chromatograms indicating the in vivo sialic acid content of 
various cells as monitored following DMB derivitization and reverse phase HPLC 
separation. Figure 37 (A) depicts the sialic acid content of lysed cell lines after 
filtration through a 10,000 MWCO membrane. The cell lines analyzed were Sf-9 
(insect) cells in standard media, SF-9 cells supplemented with 10% FBS (fetal bovine 
serum), or CHO (Chinese Hamster Ovary) cells. The original chromatogram values 
have been divided by protein concentration to normalize chromatograms. The 
standards shown are NeuSAc at 1000 finol, NeuSGc at 200 finol, and KDN at 50 ftnol. 
Figure 37 (B) depicts a chromatogram of the sialic acid content of lysates from various 
Sf-9 cells. " AcSAS Infected' 1 cell lysates were from Sf-9 cells infected with 
baculovirus containing the human SAS cDNA. The NeuSAc and KDN "Standards" are 
shown at 1,000 finol concentrations. "A35 Infected" cell lysates are from Sf-9 infected 
by baculovirus not containing the SAS cDNA. "Uninfected" cell lysates are from 
normal Sf-9 cells not infected by any baculovirus. Original chromatogram values have 
been divided by protein concentration to normalize chromatograms. Figure 37 (C) 
depicts a chromatogram of the sialic acid content from lysates of Sf-9 grown in media 
supplemented by 10 mM ManNAc; cells were infected or not infected with baculovirus 
as shown in Figure 37 (B). Original chromatogram values have been divided by protein 
concentrations to normalize chromatograms. NeuSAc and KDN standards represent 
1,000 finol. Figure 37(D) HPAEC (high performance anion-exchange chromatography) 
analysis of lysates from Sf-9 cells infected with AcSAS or A35 baculovirus with and 
without aldolase treatment. Samples were diluted prior to column loading to normalize 
sialic acid quantities based on original sample protein concentration. NeuSAc standard 
is shown at 250 pmol and KDN standard is shown at 100 pmol. 
[0160] Figure 38 depicts chromatograms of in vitro assays for sialic acid 
phosphorylation activity. Assays were performed with and without alkaline 
phosphatase (AP) treatment. Figure 38 (A) depicts chromatogram results of a NeuSAc- 
9-phosphate assay performed using lysates from Sf-9 cells infected with the AcSAS 
baculovirus (containing the human SAS cDNA). KDN and NeuSAc standards are 
shown at 5000 finol. Figure 38 (B) depicts chromatogram results of a KDN-9- 
phosphate assay performed using lysates from Sf-9 cells infected with the AcSAS 
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baculovirus (containing the human SAS cDNA). KDN and Neu5Ac standards are 
shown at 5000 ftnol. 

[0161] Figure 39 depicts a chromatogram demonstrating production of sialylated 
nucleotides in SF-9 insect cells following infection with CMP-SA synthetase and SA 
synthetase containing baculoviruses. Sf-9 cells were grown in six well plates and 
infected with baculovirus containing CMP-SA synthase and supplemented with 10 mM 
ManNAc ("CMP" line), with baculovirus containing CMP-SA synthase and SA 
synthase plus 10 mM ManNAc supplementation ("CMP+SA" line), or with no 
baculovirus and no ManNAc supplementation ("SF9" line). 

DETAILED DESCRIPTION OF THE INVENTION 
[01 62] Compositions and methods for producing glycoproteins with sialylated 
oligosaccharides are provided. In particular, the carbohydrate processing pathways of 
cell lines of interest are manipulated to produce complex sialylated glycoproteins. Such 
sialylated glycoproteins find use as pharmaceutical compositions, vaccines, diagnostics, 
therapeutics, and the like. 

[01 63] Cells of interest include, but are not limited to, mammalian cells and non- 
mammalian cells, such as, for example, CHO, plant, yeast, bacterial, insect, and the 
like. The methods of the invention can be practiced with any cells of interest. By way 
of example, methods for the manipulation of insect cells are described fully herein. 
However, it is recognized that the methods may be applied to other cells of interest to 
construct processing pathways in any cell of interest for generating sialylated 
glycoproteins. 

[0164] Oligosaccharides on proteins are commonly attached to asparagine residues 
found within Asn-X-Ser/Thr consensus sequences; such asparagine-linked 
oligosaccharides are commonly referred to as "N-linked". The sialylation of N-linked 
glycans occurs in the Golgi apparatus by the following enzymatic mechanism: CMP- 
SA + GalGlcNAcMan-R sialyltransferase SAGalGlcNAcMan-R + CMP. The 
successful execution of this sialylation reaction depends on the presence of three 
elements: 1) the correct carbohydrate acceptor substrate (designated GalGlcNAcMan-R 
in the above reaction; where the acceptor substrate is a branched glycan, 
GalGlcNAcMan is comprised by at least one branch of the glycan, the Gal is a terminal 
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Gal, and R is an N-linked glycan); 2) the proper donor nucleotide sugar, cytidine 
monophosphate-sialic acid (CMP-SA); and 3) a sialyltransferase enzyme. Each of these 
reaction components is limiting or missing in insect cells (Hooker et al. (1997) 
Monitoring the glycosylation pathway of recombinant human interferon-gamma 
produced by animal cells , Hsu et al. (1997) J. Biol. Chem. 272:9062-9070, Jarvis et al. 
(1995) Virology 212:500-51 1, Jenkins et al. (1998) Cell Culture Engineering VI , 
Oganah et al. (1996) Bio/Technology 14:197-202). 

[0165] It will be apparent to those skilled in the art that where a cell of interest is 
manipulated according to the methods of the invention such that the cell produces a 
desired level of the donor substrate CMP-SA, and expresses a desired level of 
sialyltransferase; any oligosaccharide or monosaccharide, any compound containing an 
oligosaccharide or monosaccharide, any compatible aglycon (for example Gal- 
sphingosine), any asparagine (N)-linked glycan, any serine- or threonine-linked (O- 
linked) glycan, and any lipid containing a monosaccharide or oligosaccharide structure 
can be a proper acceptor substrate and can be sialylated within the cell of interest. 
[01 66] Accordingly, the methods of the invention may be applied to generate 
sialylated glycoproteins for which the acceptor substrate is not necessarily limited to the 
structure GalGlcNAcMan-R, although this structure is particularly recognized as an 
appropriate acceptor substrate structure for production of N-linked sialylated 
glycoproteins. Thus, according to the methods of the present invention, the acceptor 
substrate can be any glycan. Preferably, the acceptor substrate according to the 
methods of the invention is a branched glycan. Even more preferably, the acceptor 
substrate according to the methods of the invention is a branched glycan comprising a 
terminal Gal in at least one branch of the glycan. Yet even more preferably, the 
acceptor substrate according to the methoids of the invention has the structure 
GalGlcNAcMan in at least one branch of the glycan and the Gal is a terminal Gal. 
[0167] It will also be apparent to those skilled in the art that engineering the 
sialylation process into cells of interest according to the methods of the present 
invention requires the successful manipulation and integration of multiple interacting 
metabolic pathways involved in carbohydrate processing. These pathways include 
participation of glycosyltransferases, glycosidases, the donor nucleotide sugar (CMP- 
SA) synthetases, and sialic acid transferases. "Carbohydrate processing enzymes" of 
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the invention are enzymes involved in any of the glycosyltransfer, glycosidase, CMP- 
SA synthesis, and sialic acid transfer pathways. Known carbohydrate engineering 
efforts have generally focused on the expression of transferases (Lee et al (1989) J. 
Biol Chem. 264:13848-13855, Wagner et al (1996)7. Virology 70:4103-4109, Jarvis 
et al (1996) Nature Biotech. 14:1288-1292, Hollister et al (1998) Glycobiology 5:473- 
480, Smith et al (1990) J. Biol Chem. 265:6225-6234, Grabenhorst et al (1995) Eur. 
J. Biochem. 232:718-725; U.S. Patent No. 5,047,335; International patent application 
publication number WO 98/06835). However, it is recognized in this invention that the 
mere insertion of one or more transferases into cells of interest does not ensure 
sialylation, as there are generally insufficient levels of the donor (CMP-SA) and the 
acceptor substrates, particularly GalGlcNAcMan-R. 

[01 68] The methods of the present invention permit manipulation of glycoprotein 
production in cells of interest by enhancing the production of donor nucleotide sugar 
substrate (CMP-SA) and optionally, by introducing and expressing sialyltransferase 
and/or acceptor substrates. By "cells of interest" is intended any cells in which the 
endogenous CMP-SA levels are not sufficient for the production of a desired level of 
sialylated glycoprotein in that cell. The cell of interest can be any eukaryotic or 
prokaryotic cell. Cells of interest include, for example, insect cells, fungal cells, yeast 
cells, bacterial cells, plant cells, mammalian cells, and the like. Human cells and cell 
lines are also included in the cells of interest and may be utilized according to the 
methods of the present invention to, for example, manipulate sialylated glycoproteins in 
human cells and/or cell lines, such as, for example, kidney, liver, and the like. By 
"desired level" is intended that the quantity of a biochemical comprised by the cell of 
interest is altered subsequent to subjecting the cell to the methods of the invention. In 
this manner, the invention comprises manipulating levels of CMP-SA and/or sialylated 
glycoprotein in the cell of interest. In a preferred embodiment of the invention, 
manipulating levels of CMP-SA and sialylated glycoprotein comprise increasing the 
levels to above endogenous levels. It is recognized that the increase can be from a non- 
detectable level to any detectable level; or the increase can be from a detected 
endogenous level to a higher level. 

[0169] According to the present invention, production of the acceptor substrate is 
achieved by optionally screening a variety of cell lines for desirable processing 



16 



enzymes, suppressing unfavorable cleavage reactions that generate truncated 
carbohydrates, and/or by enhancing expression of desired glycosyltransferase enzymes 
such as galactose transferase. Methods of enhancing expression of certain carbohydrate 
processing enzymes, including but not limited to, glycosyltransferases, are described in 
U.S. Patent No. 5,047,335 and International patent application publication number WO 
98/06835, the contents of which are herein incorporated by reference. 
[0170] According to the present invention, production of the donor substrate, CMP- 
SA, may be achieved by adding key precursors such as N-acetylmannosamine 
(ManNAc), N-acetylglucosamine (GlcNAc) and glucosamine to cell growth media, by 
enhancing expression of limiting enzymes in CMP-SA production pathway in the cells, 
or any combination thereof. 

[0171] For purposes of the present invention, by "enhancing expression" is intended 
to mean that the translated product of a nucleic acid encoding a desired protein is higher 
than the endogenous level of that protein in the host cell in which the nucleic acid is 
expressed. In a preferred embodiment of the invention, the biological activity of a 
desired carbohydrate processing enzyme is increased by enhancing expression of the 
enzyme. 

[0172] For the purposes of the invention, by "suppressing activity" is intended to 
mean decreasing the biological activity of an enzyme. In this aspect, the invention 
encompasses reducing the endogenous expression of the enzyme protein, for example, 
by using antisense and/or ribozyme nucleic acid sequences corresponding to the amino 
acid sequences of the enzyme; gene knock-out mutagenesis; and/or by inhibiting the 
activity of the enzyme protein, for example, by using chemical inhibitors. 
[0173] By "endogenous" is intended to mean the type and/or quantity of a biological 
function or a biochemical composition that is present in a naturally occurring or 
recombinant cell prior to manipulation of that cell according to the methods of the 
invention. 

[0174] By "heterologous" is intended to mean the type and/or quantity of a 
biological function or a biochemical composition that is not present in a naturally 
occurring or recombinant cell prior to manipulation of that cell by the methods of the 
invention. 
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[0175] For purposes the present invention, by "a heterologous polypeptide or 
protein" is meant as a polypeptide or protein expressed (i.e. synthesized) in a cell 
species of interest that is different from the cell species in which the polypeptide or 
protein is normally expressed (i.e. expressed in nature). 

[0176] Methods for determining endogenous and heterologous functions and 
compositions relevant to the invention are provided herein; and otherwise encompass 
those methods known in the art. 

Generation of Acceptor Carbohydrate Substrate: GalGlcNAcMan-R: 
[0177] According to the methods of the present invention, production of the 
acceptor substrate glycan GalGlcNAcMan-R, is particularly desirable for the sialylation 
reaction of N-linked glycoproteins, moreover the terminal Gal is required. Thus, in one 
embodiment of the invention the cells of interest are manipulated (using techniques 
described herein or otherwise known in the art) to contain this substrate. For example, 
for insect cells which principally produce truncated carbohydrates terminating in Man 
or GlcNAc, such cells may routinely be manipulated to produce a significant fraction of 
complex oligosaccharides terminating in Gal. Three non limiting, non-exclusive 
approaches that may be routinely applied to produce a significant fraction of complex 
oligosaccharides terminating in Gal include: (1) developing screening assays to analyze 
a selection of insect cell lines for the presence of particular carbohydrate processing 
enzymes; (2) elevating production of Gal-terminated oligosaccharides by expressing 
specific enzymes relevant to carbohydrate processing pathways; and (3) suppressing 
carbohydrate processing pathways that produce truncated N-linked glycans which 
cannot serve as acceptors in downstream glycosyltransferase reactions. 
[0178] Thus, in one embodiment, to produce GalGlcNAcMan-R acceptor substrates 
according to the methods of the invention, cell lines of interest are initially, and 
optionally, screened to identify cell lines with the desired endogenous carbohydrate 
production for subsequent metabolic manipulations. More particularly, the screening 
process includes characterizing cell lines for glycosyl transferase activity using 
techniques described herein or otherwise known in the art. Furthermore, it is 
recognized that any screened cell line could generate some paucimannosidic 
carbohydrates. Accordingly, the screening process also includes using techniques 
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described herein or otherwise known in the art to characterize cell lines for particular 
glycosidase activity leading to production of paucimannosidic structures. 
[0179] Thus, in another embodiment, for the production of the acceptor substrates, 
the invention encompasses utilizing methods described herein or otherwise known in 
the art to enhance the expression of one or more transferases. Such methods include, 
but are not limited to, methods that enhance expression of Gal T, GlcNAc -TI and -Til 
or any combination thereof; for example, as described in International patent 
application publication number WO 98/06835 and U.S. Patent No. 5,047,335. 
[0180] Thus, in another embodiment, concentrations of acceptor substrates are 
increased by using methods described herein or otherwise known in the art to suppress 
the activity of one or more endogenous glycosidases. By way of example, an 
endogenous glycosidase, the activity of which may be suppressed accoreding to the 
methods of the invention includes, but is not limited to, the hexosaminidase, N- 
acetylglucosaminidase (an enzyme that degrades the substrate required for 
oligosaccharide elongation). 

[0181] Thus, the invention encompasses enhancing metabolic pathways that produce 
the desired acceptor carbohydrates and/or suppressing those pathways that produce 
truncated acceptors. 

Characterizing cell lines using enzyme scr eening assay 

[0182] The cell lines of interest produce different N-glycan structures. Thus, such 
cells can routinely be screened using techniques described herein or otherwise known in 
the art to determine the presence of carbohydrate processing enzymes of interest. In 
insect cells, for example, different insect cell lines produce very different N-glycan 
structures (Jarvis et al. (1905) Virology 212:500-51 1, Hsu et al. (1997) J. Biol. Chem. 
272:9062-9070, Nishimura et al. (1996) Bioorg. Med. Chem. 4:91-96). However, only 
a few cell lines have been characterized, in part due to the lack of efficient screening 
assays. The present invention provides methods implementing fluorescence energy 
transfer and Europium fluorescence assays to screen a selection of different cells of 
interest, such as, for example, insect cell lines for the presence of critical carbohydrate 
processing enzymes. 
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[01831 Analytical bioassays described herein or otherwise known in the art are also 
provided according to the methods of the present invention to detect the presence of 
favorable carbohydrate processing enzymes, including, but not limited to, galactosyl 
transferase (Gal T), GlcNAc transferase I (GlcNAc T I), and sialyltransferase; and to 
detect undesirable enzymes including, but not limited to, N-acetylglucosaminidase. 
[0184] Where the cells of interest are insect cells, it will be immediately apparent 
that substantial diversity exists among established insect cell lines due to the range of 
species and tissues from which these lines were derived. Many of these lines can 
routinely be infected by the baculovirus, Autographa californica nuclear polyhidrosis 
virus (AcMNPV), and used for the production of heterologous proteins. However, only 
a few cell lines are routinely used for recombinant protein production using techniques 
described herein or otherwise kncfwn in the art. These cell lines will be immediately 
apparent by one skilled in the art. It is recognized that any cell line can be screened for 
specific carbohydrate processing enzymes, and manipulated for the purposes of the 
present invention. Examples of such cell lines include, but are not limited to, insect 
cell lines, including but not limited to, Spodoptera frugiperda (e.g. Sf-9 or Sf-21 cells), 
Trichoplusia ni (T. ni), and Estigmene acrea (Ea4). Spodoptera frugiperda lines (Sf-9 
or Sf-21) are the most widely used cell lines and a significant amount information is 
known about the oligosaccharide processing in these cells. Trichoplusia ni (e.g. High 
Five™ cells; Invitrogen Corp., Carlsbad, CA, USA) cells have been shown to secrete 
high yields of heterologous proteins with attached hybrid and complex N-glycans 
(Davis et al. (1993) In Vitro Cell. Dev. Biol. 29:842-846). Estigmena acrea (Ea-4) 
have been used to generate hybrid and complex N-linked oligosaccharides terminating 
in GlcNAc and Gal residues (Oganah et al. (1996) Bio/Technology 14:197-202). 
[0185] Drosophila Schneider S2 cell lines represent another insect cell line used for 
the production of heterologous proteins. Though these cells cannot be infected by the 
AcNPV expression vector, they are used for production of heterologous proteins via an 
alternative technology known in the art. These cell lines represent other insect cell line 
candidates whose glycosylation processing characteristics may be modified to include 
sialylation. 

[01 86] In insect cells, paucimannosidic structures are produced by a membrane- 
bound N-acetylglucosaminidase, which removes terminal GlcNAc residues from the 




alpha{\3) arm of the trimannosyl core (Altmann et al (1995) J. Biol Chem. 
270:17344-17349). This trimannosyl core structure lacks the proper termini required for 
conversion of side chains to sialylated complex structures; therefore, suppression of the 
N-acetylglucosaminidase activity can reduce or eliminate the formation of these 
undesired oligosaccharide structures, as illustrated in Figure 17. 
[0187] To reduce the N-acetylglucosaminidase activity in the target insect cell 
line(s), the invention provides vectors encoding N-acetylglucosaminidase or other 
glucosaminidase cDNAs in the antisense orientation and/or, vectors encoding 
ribozymes and/or, vectors containing sequences capable of "knocking out" the N- 
acetylglucosaminidase other glucosaminidase genes via homologous recombination. 
Expression plasmids described herein or otherwise known in the art are constructed 
using techniques known in the art to produce stably-transformed insect cells that 
constitutively express the antisense construct and/or ribozyme construct to suppress 
translation of N-acetylglucosaminidase other glucosaminidases or alternatively, to use 
homologous recombination techniques known in the art are to "knock-out" the N- 
acetylglucosaminidase other glucosaminidase genes. Particular sequences to be used in 
the antisense and/or ribozyme construction are described herein, for example, in 
Example 4. Techniques described herein or otherwise known in the art may be 
routinely applied to analyze N-linked oligosaccharide structures and to determine if N- 
glycan processing is altered and of the number of paucimannosidic structures in these 
cells is reduced. 

[0188] Antisense technology can be used to control gene expression through 
antisense DNA or RNA or through triple-helix formation. Antisense techniques are 
discussed, for example, in Okano, J. Neurochem. 56: 560 (1991); 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988). Antisense technology can be used to control gene expression through 
antisense DNA or RNA, or through triple-helix formation. Antisense techniques are 
discussed for example, in Okano, J., Neurochem. 56:560 (1991); 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988). Triple helix formation is discussed in, for instance Lee et al., 
Nucleic Acids Research 6: 3073 (1979); Cooney et al., Science 241: 456 (1988); and 
Dervan et al., Science 251 : 1360 (1991). The methods are based on binding of a 
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polynucleotide to a complementary DNA or RNA. For example, the 5' coding portion 
of a polynucleotide that encodes the amino terminal portion of N- 
acetylglucosaminidase and/or other glucosaminidases may be used to design antisense 
RNA oligonucleotides of from about 10 to 40 base pairs in length. A DNA 
oligonucleotide is designed to be complementary to a region of the gene involved in 
transcription thereby preventing transcription and the production of N- 
acetylglucosaminidase and/or other glucosaminidases. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into N-acetylglucosaminidase and/or other glucosaminidase polypeptides. 
The oligonucleotides described above can also be delivered to cells such that the 
antisense RNA or DNA may be expressed in vivo to inhibit production of N- 
*S acetylglucosaminidase and/or other glucosaminidases. 

ijl [0189] In one embodiment, the N-acetylglucosaminidase and/or other 

! 3 glucosaminidase antisense nucleic acids of the invention are produced intracellularly by 

'P transcription from an exogenous sequence. For example, a vector or a portion thereof, 

T is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a 

\t vector would contain a sequence encoding a N-acetylglucosaminidase and/or other 

if glucosaminidase antisense nucleic acids. Such a vector can remain episomal or 

□ become chromosomally integrated, as long as it can be transcribed to produce the 

j " desired antisense RNA. Such vectors can be constructed by recombinant DNA 

technology methods standard in the art. Vectors can be plasmid, viral, or others know 
in the art, used for replication and expression in insect, yeast, mammalian, and plant 
cells. Expression of the sequences encoding N-acetylglucosaminidase and/or other 
glucosaminidases, or fragments thereof, can be by any promoter known in the art to act 
in insect, yeast, mammalian, and plant cells. Such promoters can be inducible or 
constitutive. Such promoters include, but are not limited to, the baculovirus polyhedrin 
promoter (Luckow et al (1993) Curr. Opin. Biotech. 4:564-572, Luckow et al (1995)), 
the SV40 early promoter region (Bernoist and Chambon, Nature 29:304-310 (1981), 
the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., Cell 22:787-797 (1980), the herpes thymidine promoter (Wagner et 
al., Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445 (1981), the regulatory sequences of the 
metallothionein gene (Brinster, et al., Nature 296:39-42 (1982)), etc. 
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[0190] The antisense nucleic acids of the invention comprise sequences 
complementary to at least a portion of an RNA transcript of N-acetylglucosaminidase 
and/or other glucosaminidase genes. However, absolute complementarity, although 
preferred, is not required. A sequence "complementary to at least a portion of an 
RNA ," referred to herein, means a sequence having sufficient complementarity to be 
able to hybridize with the RNA, forming a stable duplex; in the case of double stranded 
N-acetylglucosaminidase and/or other glucosaminidase antisense nucleic acids, a single 
strand of the duplex DNA may thus be tested, or triplex formation may be assayed. 
The ability to hybridize will depend on both the degree of complementarity and the 
length of the antisense nucleic acid Generally, the larger the hybridizing nucleic acid, 
the more base mismatches with a N-acetylglucosaminidase and/or other 
glucosaminidase RNAs it may contain and still form a stable duplex (or triplex as the 
case may be). One skilled in the art can ascertain a tolerable degree of mismatch by use 
of standard procedures to determine the melting point of the hybridized complex. 
[0191] Oligonucleotides that are complementary to the 5* end of the message, e.g., 
the 5' untranslated sequence up to and including the AUG initiation codon, should 
work most efficiently at inhibiting translation. However, sequences complementary to 
the 3* untranslated sequences of mRNAs have been shown to be effective at inhibiting 
translation of mRNAs as well. See generally, Wagner, R., 1994, Nature 372:333-335. 
Thus, oligonucleotides complementary to either the 5'- or 3'- non- translated, non- 
coding regions of N-acetylglucosaminidase and/or other glucosaminidases, could be 
used in an antisense approach to inhibit translation of endogenous N- 
acetylglucosaminidase and/or other glucosaminidase mRNAs. Oligonucleotides 
complementary to the 5* untranslated region of the mRNA should include the 
complement of the AUG start codon. Antisense oligonucleotides complementary to 
mRNA coding regions are less efficient inhibitors of translation but could be used in 
accordance with the invention. Whether designed to hybridize to the 5'-, 3'- or coding 
region of N-acetylglucosaminidase and/or other glucosaminidase mRNAs, antisense 
nucleic acids should be at least six nucleotides in length, and are preferably 
oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific aspects 
the oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 . 
nucleotides or at least 50 nucleotides. 
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[0192] The polynucleotides of the invention can be DNA or RNA or chimeric 
mixtures or derivatives or modified versions thereof, single-stranded or double- 
stranded. The oligonucleotide can be modified at the base moiety, sugar moiety, or 
phosphate backbone, for example, to improve stability of the molecule, hybridization, 
etc. The oligonucleotide may include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo), agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553- 
6556; Lemaitre et al., Proc. Natl. Acad. Sci. 84:648-652 (1987); PCT Publication No. 
WO88/09810, published December 15, 1988), or hybridization-triggered cleavage 
agents (See, e.g., Krol et al., BioTechniques 6:958-976 (1988)) or intercalating agents. 
(See, e.g., Zon, Pharm. Res. 5:539-549 (1988)). To this end, the oligonucleotide may 
be conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking 
agent, transport agent, hybridization-triggered cleavage agent, etc. 
[0193] The antisense oligonucleotide may comprise at least one modified base 
moiety which is selected from the group including, but not limited to, 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
[0194] The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including, but not limited to, arabinose, 
2-fluoroarabinose, xylulose, and hexose. 

[0195] In yet another embodiment, the antisense oligonucleotide comprises at least 
one modified phosphate backbone selected from the group including, but not limited to, 
a phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, 
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a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal 
or analog thereof. 

[0196] In yet another embodiment, the antisense oligonucleotide is an 
alpha-anomeric oligonucleotide. An alpha -anomeric oligonucleotide forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual 
beta-units, the strands run parallel to each other (Gautier et al., Nucl. Acids Res. 
15:6625-6641 (1987)). The oligonucleotide is a 2-0-methylribonucleotide (Inoue et al., 
Nucl. Acids Res. 15:6131-6148 (1987)), or a chimeric RNA-DNA analogue (Inoue et 
al., FEBS Lett 215:327-330 (1997)). 

[0197] Polynucleotides of the invention may be synthesized by standard methods 
known in the art, e.g. by use of an automated DNA synthesizer (such as are 
commercially available from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides may be synthesized by the method of Stein et al. 
(Nucl. Acids Res. 16:3209 (1988)), methylphosphonate oligonucleotides can be 
prepared by use of controlled pore glass polymer supports (Sarin et al., Proc. Natl. 
Acad. Sci. U.S.A. 85:7448-7451 (1988)), etc. 

[0198] While antisense nucleotides complementary to the N-acetylglucosaminidase 
and/or other glucosaminidase coding region sequences could be used, those 
complementary to the transcribed untranslated region are most preferred. 
[0199] Potential N-acetylglucosaminidase or other glucosaminidase activity 
suppressors according to the invention also include catalytic RNA, or a ribozyme (See, 
e.g., PCT International Publication WO 90/1 1364, published October 4, 1990; Sarver et 
al, Science 247:1222-1225 (1990). While ribozymes that cleave mRNA at site specific 
recognition sequences can be used to destroy N-acetylglucosaminidase and/or other 
glucosaminidase mRNAs, the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that 
form complementary base pairs with the target mRNA. The sole requirement is that 
the target mRNA have the following sequence of two bases: 5 f -UG-3\ The 
construction and production of hammerhead ribozymes is well known in the art and is 
described more fully in Haseloff and Gerlach, Nature 334:585-591 (1988). Preferably, 
the ribozyme is engineered so that the cleavage recognition site is located near the 5* 
end of the N-acetylglucosaminidase and/or other glucosaminidase mRNAs; i.e., to 




increase efficiency and minimize the intracellular accumulation of non- functional 
mRNA transcripts. 

[0200] As in the antisense approach, the ribozymes of the invention can be 
composed of modified oligonucleotides (e.g. for improved stability, targeting, etc.) and 
should be delivered to cells which express N-acetylglucosaminidase and/or other 
glucosaminidases in vivo. DNA constructs encoding the ribozyme may be introduced 
into the cell in the same manner as described above for the introduction of antisense 
encoding DNA. A preferred method of delivery involves using a DNA construct 
"encoding" the ribozyme under the control of a strong constitutive promoter, such as, 
for example, pol m or pol H promoter, so that transfected cells will produce sufficient 
quantities of the ribozyme to destroy endogenous N-acetylglucosaminidase and/or other 
glucosaminidase messages and inhibit translation. Since ribozymes unlike antisense 
molecules, are catalytic, a lower intracellular concentration is required for efficiency. 
[0201] Endogenous gene expression can also be reduced by inactivating or 
"knocking out" the N-acetylglucosaminidase and/or other glucosaminidase gene and/or 
its promoter using targeted homologous recombination. {E.g., see Smithies et al., 
Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503-512 (1987); Thompson 
et al., Cell 5:313-321 (1989); each of which is incorporated by reference herein in its 
entirety). For example, a mutant, non- functional polynucleotide of the invention, or a 
completely unrelated DNA sequence (such as for example, a sialic acid synthetase) 
flanked by DNA homologous to the endogenous polynucleotide sequence (either the 
coding regions or regulatory regions of the gene) can be used, with or without a 
selectable marker and/or a negative selectable marker, to transfect cells that express 
polypeptides of the invention in vivo. In another embodiment, techniques known in the 
art are used to generate knockouts in cells that contain, but do not express the gene of 
interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 
research and agricultural fields where modifications to embryonic stem cells can be 
used to generate animal offspring with an inactive targeted gene (e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). The contents of each of the documents 
recited in this paragraph is herein incorporated by reference in its entirety. 
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[0202] The use of chemical inhibitors is also within the scope of the present 
invention, in addition to, or as an alternative to, the antisense approach, and/or the 
ribozyme approach, and/or the gene "knock-out" approach, as means for suppressing 
glucosaminidase activity in insect cell cultures. Chemical inhibitors that may be used 
to suppress glucosaminidase activity include, but are not limited to, 2-acetamido- 1,2,5- 
trideoxy-1,5 amino-D-glucitol can limit the N-acetylglucosaminidase activity in insect 
cells (Legler et al (1991) Biochim. Biophys. Acta 1080:80-95, Wagner et al (1996) J. 
Virology 70:4103-4109). In addition, a number of other N-acetylglucosaminidase 
inhibitors may also be used according to the present invention, including, but not 
limited to, nagastatin (with a Ki value in the 10" 8 range) and GlcNAc-oxime (Ki in 0.45- 
22 mM) which are commercially, publicly, or otherwise available for the purposes of 
the present invention (Nishimura et al (1996) Bioorg. Med. Chem. 4:91-96, Aoyagi et 
al. (1992)/. Antibiotics 45:1404-1408). 

[0203] The chemical inhibitors mentioned above do not distinguish between 
lysosomal N-acetylglucosaminidase and the target membrane-bound N- 
acetylglucosaminidase activity in the secretory compartment. Thus, a more specific 
inhibitor, based on the substrate structure, is provided to serve not merely as a 
competitive inhibitor, but also as an affinity labeling reagent. The chemical structure 
for two possible chemical compounds with specificity for inhibiting membrane-bound 
glucosaminidase one or both of which may be used according to the present invention, 
are shown in Figure 19. Subsequent to expression and purification of the N- 
acetylglucosaminidase, the effectiveness of these inhibitors may be tested and 
compared in in vitro and/or in vivo trials using techniques described herein or otherwise 
known in the art. As above, these chemical inhibitors are then used in addition to, or as 
an alternative to, antisense suppression, ribozyme suppression, and/or gene knock-out 
mutagenesis, of glucosaminidase activity in insect cells. 

[0204] It is recognized that the suppression of glucosaminidase activity alone may 
not lead to production of the desired acceptor carbohydrate, if the enzymes responsible 
for generating structures terminating in Gal are lacking in particular cell lines. Thus, 
according to the methods of the present invention, Gal T activity in insect cells can be 
increased significantly by using techniques described described herein or otherwise 
known in the art to express a heterologous gene using a baculovirus construct 
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containing nucleic acid sequences encoding Gal T or a fragment or variant thereof, or 
by stably transforming the cells with a gene coding for Gal T or a fragment or variant 
thereof. If N-glycan analysis indicates that lower than a desired level of the acceptor 
substrates are present even following glucosaminidase suppression, techniques 
described herein or otherwise known in the art may be applied to express 
glycosyltransferase enzymes as needed in insect cells to produce a larger fraction of the 
desired acceptor structures. Figure 20 depicts that the overexpression of various 
glycosyltransferases leads to greater production of acceptor substrates. 
[0205] Alternatively, the expression of glycosyltransferases will serve to limit 
generation of paucimannosidic structures by generating unacceptable glucosaminidase 
substrates terminating in Gal, or by competing against the glucosaminidase reaction 
(Wagner et al, Glycobiology 6:165-175 (1996)). 

[0206] Thus, the invention comprises expression of glycosyltransferases combined 
with, or as an alternative to, suppression of N-acetylglucosaminidase activity in selected 
insect cell lines to produce desired quantities of carbohydrates containing the correct 
Gal (G) acceptor substrate for sialylation. Figure 21 illustrates, without limitation, 
three examples of acceptor N-glycan structures that comprise the terminal Gal acceptor 
residue required for subsequent sialylation. Other desired carbohydrates structures with 
a branch terminating Gal are also possible and are encompassed by the invention. 
[0207] Baculovirus expression vectors containing the coding sequence for GlcNAc - 
TI and -Til, and Gal T or fragments or variants thereof, and stable transfectants 
overexpressing GlcNAc-TI and GlcNAc-TH, and Gal T, or fragments or variants 
thereof are known, can be routinely generated using techniques known in the art, and 
are commercially, publicly, or otherwise available for the purposes of this invention. 
(See Jarvis et al (1996) Nature Biotech. 14:1288-1292; Hollister et al (1998) 
Glycobiology 8: 473-480; the contents of which are herein incorporated by reference). 
In addition, stable transfectants expressing GlcNAc-TI and GlcNAc-TH can be 
routinely generated using techniques known in the art, if overexpression proves 
desirable. 

Production and delivery of the Donor Substrate: CMP-Sialic Acid (CMP-SA) 
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[0208] For production of the donor substrate, CMP-SA, the invention provides 
methods and compositions comprising expression of limiting enzymes in the CMP-SA 
production pathway; in addition, or as an alternative to, the feeding of precursor 
substrates. 

[0209] To produce sialylated N-linked glycoproteins, the donor substrate, CMP- 
sialic acid (CMP-SA), must be synthesized. The structure of CMP-SA is shown in 
Figure 22. CMP-SA can be enzymatically synthesized from glucose or other simple 
sugars, glutamine, and nucleotides in mammalian cells and E. coli using the metabolic 
pathways shown in Figure 5, and as described in Ferwerda et al (1983) Biochem, J. 
27(5:87-92; Mahmoudian et al (1997) Enzyme and Microbial Technology 20:393-400; 
Schachter et al (1973) Metabolic Conjugation and Metabolic Hydrolysis (New York 
Academic Press) 2-135. 

[0210] In some mammalian tissues and cell lines, the production and delivery of 
CMP-SA limits the sialylation capacity of these cells (Gu et al. (1997) Improvement of 
the interferon-gamma sialylation in Chinese hamster ovary cell culture by feeding N- 
acetylmannosamine). This problem is likely to be amplified in insect cells since 
negligible sialic acid levels are detected in Trichoplusia ni insect cells as compared to 
levels in Chinese Hamster Ovary (CHO) mammalian cells (Figure 16). Furthermore, 
negligible CMP-SA was observed in Sf-9 and Ea-4 insect cells when compared to CHO 
cells (Hooker et al (1997) Monitoring the Glycosylation Pathway of Recombinant 
Human Interferon-Gamma Produced by Animal Cells, European Workshop on Animal 
Cell Engineering, Costa Brava, Spain; and Jenkins (1998) Restructuring the 
Carbohydrates of Recombinant Glycoproteins, Cell Culture Engineering VI, San 
Diego, CA). These findings are relevant in light of the previously published 
observation that polysialic acid can be detected in Drosophila embryos (Roth et al 
(1992) Science 256:673-675) and the observation of sialylated glycoproteins produced 
by other insect cells (Davidson et al (1990) Biochemistry 2P:5584-5590). 
[02 1 1 ] Production of sialic acid (S A), more specifically N-acetylneuraminic acid 
(NeuAc), from the precursor substrate ManNAc can proceed through three alternative 
pathways shown in Figure 5. The principal pathway for the production of SA in E. coli 
and other bacteria utilizes the phosphoenylpyruvate (PEP) and ManNAc to produce 
sialic acids in the presence of sialic acid synthetase (Vann et al (1997) Glycobiology 
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7:697-701). A second pathway, observed in bacteria and mammals, involves the 
reversible conversion by aldolase (also named N-acetylneuraminate lyase) of ManNAc 
and pyruvate to sialic acid (Schachter et al. (1973) Metabolic Conjugation and 
metabolic Hydrolysis (New York Academic Press) 2-135, Lilley et al. (1992) Prot. 
Expr. and Pur. 3:434-440). The aldolation reaction equilibrates toward ManNAc but 
can be manipulated to favor the production of sialic acid by the addition of excess 
ManNAc or pyruvate in vitro (Mahmoudian et al. (1997) Enzyme and Microbial 
Technology 20:393-400). The third pathway, observed only in mammalian tissue, 
begins with the ATP driven phosphorylation of ManNAc, and is followed by the 
enzymatic conversion of phosphorylated ManNAc to a phosphorylated form of sialic 
acid, from which the phosphate is removed in a subsequent step (van Rinsum et al. 
(1983) Biochem. J. 210:21-28, Schachter et al. (1973) Metabolic Conjugation and 
metabolic Hydrolysis (New York Academic Press) 2-135). 

[02 1 2] According to one embodiment of the invention, to overcome intracellular 
limitations of CMP-SA in mammalian cells, feeding of alternative precursor substrates 
may be applied to eliminate or reduce the need to produce CMP-SA from simple sugars 
(see Example 6). Since CMP-SA and its direct precursor, SA, are not permeable to cell 
membranes (Bennetts et al (1981) J. Cell. Biol. 88:1-15), these substrates cannot be 
added to the culture medium for uptake by the cell. However, other precursors, 
including N-acetylmannosamine (ManNAc), glucosamine, and N-acetylglucosamine 
(GlcNAc) when added to the culture medium are absorbed into mammalian cells (see 
Example 6). See, for example, Gu et al. (1997) Improvement of the interferon-gamma 
sialylation in Chinese hamster ovary cell culture by feeding N-acetylmannosamine, 
Zanghi et al. (1997) European Workshop on Animal Cell Engineering, Ferwerda et al. 
(1983) Biochem. J. 216:87-92, Kohn et al. (1962) J. Biol. Chem. 237:304-308, Thomas 
et al. (1985) Biochim. Biophys. Acta 846:37-43, Bennetts et al. (1981) J. Cell. Biol. 
88:1-15. The substrates are then enzymatically converted to CMP-SA and incorporated 
into homologous and heterologous glycoproteins {Gu et al. (1997) Improvement of the 
interferon-gamma sialylation in Chinese hamster ovary cell culture by feeding N- 
acetylmannosamine, Ferwerda et al. (1983) Biochem. J. 216:87-92, Kohn et al. (1962) 
J. Biol. Chem. 237:304-308, Bennetts et al. (1981) J. Cell. Biol. 88:1-15). 
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[0213J To be incorporated into oligosaccharides, sialic acid and cytidine 
triphosphate (CTP) must be converted to CMP-SA by the enzyme, CMP-sialic acid 
(CMP-SA) synthetase (Schachter et al. (1973) Metabolic Conjugation and metabolic 
Hydrolysis (New York Academic Press) 2-135): 



Sialic Acid + CTP -»CMP-SA + PPi 



[0214] This enzyme has been cloned and sequenced from E. coli and used for the in 
vitro production of CMP-SA, as described in Zapata et al. (1989) J. Biol. Chem. 
264:147 '69-14774, Kittleman et al. (1995) Appl. Microbiol. Biotechnol. 44:59-67, 
Ichikawae/a/. (1992) Anal. Biochem. 202:215-238, Shames et al. (1991) Glycobiology 
1:187-191; the contents of which are herein incorporated by reference). 
[0215] In eukaryotes, the activated sugar nucleotide, CMP-SA, must be transported 
into the Golgi lumen for sialylation to proceed (Deutscher et al. (1984) Cell 39:295- 
299). Transport through the trans-Golgi membrane is facilitated by the CMP-SA 
transporter protein, which was identified by complementation cloning into sialylation 
deficient CHO cells (Eckhardt et al. (1996) Proc. Natl. Acad. Sci. USA 93:7572-7576). 
This mammalian gene has also been cloned and expressed in a functional form in the 
heterologous host, S. cerevisiae (Bernisone et al. (1997) J. Biol. Chem. 272:12616- 
12619). 

[021 6] In addition to feeding of external precursor substrates such as ManNAc, 
GlcNAc, or glucosamine to increase CMP-SA levels, a supplementary approach in 
which CMP-SA transporter genes are introduced and expressed using routine 
recombinant DNA techniques may also be employed according to the methods of the 
present invention. These techniques are optionally combined with ManNAc, GlcNAc, 
or glucosamine feeding strategies described above, to maximize CMP-SA production. 

Conversion of GlcNAc or glucosamine to ManNAc 

[0217] Also according to the methods of the present invention, where the utilization 
of GlcNAc or glucosamine is preferred and ManNAc is not generated naturally in insect 
cells, ManNAc can be produced chemically using sodium hydroxide (Mahmoudian et 
al. (1997) Enzyme and Microbial Technology 20:393-400). Alternatively, the enzymes 
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that convert these substrates to ManNAc or fragments or variants of these enzymes, can 
be expressed in insect cells using techniques described herein or otherwise known in 
the art. The production of ManNAc from GlcNAc and glucosamine proceeds through 
the metabolic pathway shown in Figure 23. 

[0218] Two approaches are provided to accomplish this conversion: (a) direct 
epimerization of GlcNAc; or (b) conversion of GlcNAc or glucosamine to UDP-N- 
acetylglucosamine (UDP-GlcNAc), and then ManNAc. According to one embodiment 
of the invention, approach (a) is achieved using the gene encoding a GlcNAc-2- 
epimerase isolated from pig kidney, or fragments or variants thereof, to directly convert 
GlcNAc to ManNAc (See Maru et al. (1996) J. Biol. Chem. 271:16294-16299; the 
contents of which are herein incorporated by reference). Additionally, the sequence for 
a homologue of this enzyme can be routinely obtained from bioinformatics databases, 
and cloned into baculovirus vectors, or stably integrated into insect cells using 
techniques described herein or otherwise known in the art. 

[0219] Alternatively, approach (b) requires insertion of the gene to convert UDP- 
GlcNAc to ManNAc. Engineering the production of UDP-GlcNAc from glucosamine 
or GlcNAc is likely not required since most insect cells comprise metabolic pathways 
to synthesize UDP-GlcNAc; as indicated by the presence of GlcNAc-containing 
oligosaccharides. According to one embodiment of the invention, the gene encoding a 
rat bifunctional enzyme coding for conversion of UDP-GlcNAc to ManNAc and 
ManNAc to ManNAc-6-P, or fragments or variants thereof is used to engineer the 
production of UDP-GlcNAc using techniques described herein or otherwise known in 
the art (Stasche et al. (1997) J. Biol. Chem. 272:24319-24324, the contents which are 
herein incorporated by reference). In a specific embodiment, the segment of this 
enzyme responsible for conversion of UDP-GlNAc to ManNAc may be expressed 
independently in insect cells using techniques known in the art to produce ManNAc 
rather than ManNAc-6-P. 

Conversion of ManNAc to SA 

[0220] Once ManNAc is generated, it is converted to S A according to the methods 
of the invention. There are three possible metabolic pathways for the conversion of 
ManNAc to SA in bacteria and mammals, as shown in Figure 24. Negligible SA levels 
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have previously been observed in insect cells (in the absence of exogenous 
supplementation of ManNAc to the culture media). 

[022 1 ] The conversion of ManNAc and PEP to S A using sialic acid synthetase is the 
predominant pathway for SA production in E. coli (Vann et al. (1997) Glycobiology 
7:697-701). The E. coli sialic acid (SA) synthetase gene NeuB (SEQ ID NO:7 and 8) 
has been cloned and sequenced and is commercially, publicly, and/or otherwise 
available for the purposes of the present invention. Additionally, as disclosed herein, 
the human sialic acid synthetase gene has also been cloned (cDNA clone HA5AA37), 
sequenced, and deposited with the American Type Culture Collection ("ATCC") on 
February 24, 2000 and was given the ATCC Deposit Number PTA-1410. (The ATCC 
is located at 10801 University Boulevard, Manassas, VA 201 10-2209, USA. ATCC 
deposits were made pursuant to the terms of the Budapest Treaty on the international 
recognition of the deposit of microorganisms for purposes of patent procedure.) Thus, 
for enhancing expression of SA synthetase according to certain embodiments of the 
invention, the nucleic acid compositions encoding a SA synthetase such as, for 
example, an E.coli and/or human sialic acid synthetase and/or a fragment or variant 
thereof, may be inserted into a host expression vector or into the host genome using 
techniques described herein or otherwise known in the art. According to the methods 
of the invention, the production of S A can also be achieved from ManNAc and 
pyruvate using an aldolase, such as, for example, bacterial aldolase (Mahmoudian et al. 
(1997) Enzyme and Microbial Technology 20:393-400), or a human aldolase (as 
described herein) or fragment or variant thereof. The human aldolase gene has been 
cloned (cDNA clone HDPAK85), sequenced, and deposited with the American Type 
Culture Collection ("ATCC") on February 24, 2000 and was given the ATCC Deposit 
Number PTA-1410. Thus, the aldolase enzyme is considered as an alternative for 
converting ManNAc to S A. For enhancing expression of aldolase, the aldolase 
sequences can be amplified directly from E. coli and human DNA using primers and 
PCR amplification as described in Mahmoudian et al. (Mahmoudian et al. (1997) 
Enzyme and Microbial Technology 20:393-400); the contents of which are herein 
incorporated by reference) and herein, and using techniques described herein or 
otherwise known in the art to enhance expression of aldolase, or a fragment or variant 
thereof. Since the aldolase reaction is reversible, high levels of added ManNAc and 
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pyruvate, may be used according to the methods of the invention to drive this reversible 
reaction in the direction of the product SA (Mahmoudian et al. (1997) Enzyme and 
Microbial Technology 20:393-400). 

[0222] In addition to the pathways which convert ManNAc to S A present in both 
prokaryotes and eukaryotes, an exclusively eukaryotic pathway may also employed 
according to the methods of the invention to convert ManNAc to SA through the 
phosphate intermediates ManNAc-6-phosphate and SA-9-phosphate. It is recognized 
that the mammalian enzymes (synthetase and phosphatase) responsible for converting 
ManNAc to SA through phosphate intermediates can be utilized for engineering this 
eukaryotic pathway into insect cells. 

Conversion ofSA to CMP-SA 

[0223] The methods of the invention also encompass the use of CMP-SA synthetase 
to enzymatically converts SA to CMP-SA (see, e.g., the reaction shown in Figure 25). 
However, insect cells, such as, for example, Sf9 insect cells, have negligible 
endogenous CMP-SA synthetase activity. Evidence of limited CMP-SA synthetase in 
insect cells is also demonstrated by increased S A levels found following substrate 
feeding and genetic manipulation without a concomitant increase in CMP-SA. 
[0224] Thus, specific embodiments of the invention provide methods for enhancing 
the expression of CMP-SA synthetase, and/or fragments or variants thereof. Bacterial 
CMP-SA synthetase has been cloned and sequenced as described in Zapata et al. (1989) 
J. Biol. Chem. 264:14769-14774; the contents of which are herein incorporated by 
reference. Additionally, as described herein the gene encoding human CMP-SA 
synthetase has also been cloned (cDNA clone HWLLM34), sequenced and deposited 
with the American Type Culture Collection ("ATCC") on February 24, 2000 and was 
given the ATCC Deposit Number PTA-1410. Thus, in specific embodiments, the 
methods of the present invention provide for enhancing expression of bacterial or 
human CMP-SA synthetase or fragments, or variants thereof, in cells of interest, such 
as, for example, in insect cells, using techniques described herein, or otherwise known 
in the art. 
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Golgi transport of CMP-SA 

[0225] CMP-SA must be delivered into the Golgi apparatus in order for sialylation 
to occur, and this transport process depends on the presence of the CMP-SA transporter 
protein (Deutscher et al. (1984) Cell 39:295-299). To determine if CMP-SA 
synthesized in insect cells is efficiently transported into the proper cellular 
compartment, insect cell vesicles are prepared and transport of CMP-SA is measured as 
described in (Bernisone et al. {\991)J. Biol. Chem. 272:12616-12619) and/or using 
techniques otherwise known in the art. Where the native enzymatic transport is lower 
than desired, a transporter enzyme is cloned and expressed in insect cells using the 
known mammalian gene sequence (as described in Bernisone et al. (1997) J. Biol. 
Chem. 272:12616-12619, Eckhardt et al. (1996) Proc. Natl. Acad. Sci. USA 93:7572- 
7576; the contents of which are herein incorporated by reference) and/or sequences 
otherwise known in the art. Corresponding sequences are available from bioinformatics 
databases for the purposes of this invention. Localization of the protein to the Golgi is 
evaluated using an antibody generated against the heterologous protein using 
techniques known in the art in concert with commercially available fluorescent probes 
that identify the Golgi apparatus. 

[0226] Expression cloning of multiple transcripts (for example, transcripts encoding 
CMP-SA pathway enzymes, glycosyl transferases, and ribozymes or anti-sense RNAs 
to suppress hexosaminidases) in a single cell line using techniques known in the art 
may be required to bring about the desired sialylation reactions and/or to optimize these 
reactions. Alternatively, co-infection of cells with multiple viruses using techniques 
known in the art can also be used to simultaneously produce multiple recombinant 
transcripts. In addition, plasmids that incorporate multiple foreign genes including 
some under the control of the early promoter ffil are commercially, publicly, or 
otherwise available for the purposes of the invention, and can be used to create 
baculovirus constructs. The present invention encompasses using any of these 
techniques. The invention also encompasses using the above mentioned types of 
vectors to enable expression of desired carbohydrate processing enzymes in baculovirus 
infected insect cells prior to production of a heterologous glycoprotein of interest under 
control of the very late polyhedrin promoter. In this manner, once the desired 
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polypeptide is synthesized essential N-glycan processing enzymes can facilitate N- 
glycan processing once the glycoprotein of interest. 

[0227] Alternatively, genes for some of the enzymes may be incorporated directly 
into the insect cell genome using vectors known in the art, such as, for example, vectors 
similar to those described in (Jarvis et al. (1990) Bio/Technology 8:950-955, Jarvis et 
al. (1995) Baculovirus Expr. Protocols ed. 39:187-202). Genomic integration 
eliminates the need to infect the cells with a large number of viral constructs. These 
constructs for genomic integration contain one or more early viral promoters, including 
AcA/NPV IE1 and 39K, which provide constitutive expression in transfected insect 
cells (Jarvis et al. (1990) Bio/Technology 8:950-955). In addition, a sequential 
transformation strategy may routinely be developed for producing stable transformants 
that constitutively express up to four different heterologous genes simultaneously. 
These vectors and transformation techniques are provided for the purposes of this 
invention. In this manner, incorporation of plasmids containing heterologous genes 
into the insect cell genome combined with baculovirus infection integrates the 
metabolic pathways leading to efficient acceptor and donor substrate production in 
insect cells. 



Generation of N-linked sialvlated glycoproteins 

[0228] The final step in the generation of sialylated glycoproteins or glycolipids in 
mammalian cells is the enzymatic transfer of sialic acid from the donor substrate, CMP- 
SA, onto an acceptor substrate in the Golgi apparatus; a reaction which is catalyzed by 
sialyltransferase. The sialic acid (SA) residues occurring in N-linked glycoproteins are 
a/p/ia-linked to the 3 or 6 position of the GalGlcNAc sugars (Tsuji, S. (1996) J. 
Biochem. 120:1-13). The SA a /p/ia2-3GalGlcNAc linkage is found in heterologous 
glycoproteins expressed by CHO and human cells and the SA a//?/ia2-6GalGlcNAc 
linkage is found in many human glycoproteins (Goochee et al. (1991) Bio/technology 
9:1347-1355). The alpha2-3- and/or a/p/ja2-6-sialyltransferase genes along with a 
number of other sialyltransferase genes have been cloned, sequenced and expressed as 
active heterologous proteins as described in Lee et al. (1989) J. Biol. Chem. 264:13848- 
13855, Ichikawa et al. (1992) Anal. Biochem. 202:215-238, Tsuji, S. (1996) J. 
Biochem. 120:1-13; U.S. Patent No. 5,047,335, the contents of which are herein 
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incorporated by reference. Any one or more of these genes, as well as fragments, 
and/or variants thereof may be introduced and expressed in cells of interest using 
techniques described herein or otherwise known in the art, and may be used according 
to the methods of the present invention to enhance the enzymatic transfer of sialic acid 
from the donor substrate. 

[0229] For generating N-Linked sialylated glycoproteins in insect cells, once the 
donor (CMP-SA) and acceptor (GalGlcNAc-R) substrates are produced as described 
above, the methods of the invention further comprise expression of a sialyltransferase 
or fragment or variant thereof, in the cells. The completion of the sialylation reaction 
can be verified by elucidating the N-glycan structures attached to a desired glycoprotein 
using techniques described herein or otherwise known in the art. It is recognized that 
evaluation of N-glycans attachments may also suggest additional metabolic engineering 
strategies that can further enhance the level of sialylation in insect cells. 
[0230] It is observed that unmodified T. ni insect cell lysates failed to generate any 
sialylated compounds when incubated with the substrate, LacMU, and the nucleotide 
sugar, CMP-SA- Thus, it is concluded that these cells comprise negligible native 
sialyltransferase activity. However, infection of insect cells with a baculovirus 
containing alpha2,3 sialyltransferase provided significant enzymatic conversion of 
LacMU and CMP-SA to sialylLacMU. For the purposes of the invention, heterologous 
sialyltransferase can be expressed using techniques described herein or otherwise 
known in the art either by co-infection with a virus coding for sialyltransferase, or 
fragment, or variant thereof, or by using stable transfectants expressing the enzyme. In 
addition to the 2,3 sialyltransferase baculovirus constructs, baculovirus vectors 
comprising sequences coding for alpha2,6 sialyltransferase and/or fragments or variants 
thereof as well as stably transformed insect cells stably expressing both gal T and 
sialyltransferase are commercially, or publicly available, and/or may routinely be 
generated using techniques described herein or otherwise known in the art. Evaluation 
of sialyltransferase activity is determined using the FRET or HPLC assays described 
herein and/or using other assays known in the art. Localization of the sialyltransferase 
to the Golgi is accomplished using anti-sialyltransferase antibodies commercially, 
publicly, or otherwise available for the purpose of this invention in concert with Golgi 
specific marker proteins. 
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[023 1 ] For the purposes of enhancing carbohydrate processing enzymes of the 
invention, suppressing activity of endogenous N-acetylglucosaminidase, expressing 
heterologous proteins in the cells of the invention, and constructing vectors for the 
purposes of the invention; genetic engineering methods are known to those of ordinary 
skill in the art. For example, see Schneider, A. et ai, (1998) Mol. Gen. Genet. 
257:308-318. Where the invention encompasses utilizing baculovirus based 
expression, such methods are known in the art, for example, as described in O'Riley et 
al. (1992) Baculovirus Expression Vectors, W.H. Freeman and Company, New York 
1992. 

[0232] For the purposes of enhancing carbohydrate processing enzymes of the 
invention, suppressing activity of endogenous N-acetylglucosaminidase, expressing 
heterologous proteins in the cells of the invention, and constructing vectors as described 
herein, known sequences can be utilized in the methods of the invention, including but 
not limited to the sequences described in GenSeq accession No. Zl 1234 and Zl 1235 
for two human galactosyltransferases (see also United States Patent Number 5,955,282; 
the contents of which are herein incorporated by reference); and/or in Genbank 
accession No. D83766 for GlcNAc-2-epimerase, Y07744 for the Afunctional rate liver 
enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, J05023 for E. 
coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, Z71268 for 
murine CMP-SA transporter, X03345 for E. coli aldolase, U05248 for E. coli SA 
synthetase, XI 7247 for human 2,6 sialyltransferase, L29553 for human 2,3 
sialyltransferase, Ml 32 14 for bovine galactosyltransferase, L77081 for human GlcNAc 
T-I, U15128 or L36537 for human GlcNAc T-H, D87969 for human CMP-SA 
transporter, and S95936 for human transferrin; and fragments or variants of the 
enzymes that display one or more of the biological activities of the enzymes (such 
biological activities may routinely be assayed using techniques described herein or 
otherwise known in the art). The sequences described above are readily accessible 
using the provided accession number in the NCBI Entrez database, known to the person 
of ordinary skill in the art. 

[02331 Thus, one aspect of the invention provides for use of isolated nucleic acid 
molecules comprising polynucleotides having nucleotide sequences selected from the 
group consisting of : (a) nucleotide sequences encoding a biologically active fragment 




or variant of the polypeptide having the amino acid sequence described in GenSeq 
accession No. Zl 1234 and Zl 1235 for two human galactosyltransferases; and/or in 
Genbank accession No. D83766 for GlcNAc-2-epimerase, Y07744 for the Afunctional 
rate liver enzyme capable of catalyzing conversion of UDP-GlcNAc to ManNAc, 
J05023 for E. coli CMP-SA synthetase, AJ006215 for murine CMP-SA synthetase, 
Z71268 for murine CMP-SA transporter, X03345 for E. coli aldolase, U05248 for E. 
coli SA synthetase, XI 7247 for human 2,6 sialyltransferase, L29553 for human 2,3 
sialyltransferase, Ml 32 14 for bovine galactosyltransferase, L77081 for human GlcNAc 
T-I, U15128 or L36537 for human GlcNAc T-H, D87969 for human CMP-SA 
transporter, and/or S95936 for human transferrin; (b) nucleotide sequences encoding an 
antigenic fragment of the polypeptide having the amino acid sequence described in 
GenSeq accession No. Zl 1234 and Zl 1235 for two human galactosyltransferases (see 
also United States Patent Number 5,955,282; the contents of which are herein 
incorporated by reference); and/or in Genbank accession No. D83766 for GlcNAc-2- 
epimerase, Y07744 for the Afunctional rate liver enzyme capable of catalyzing 
conversion of UDP-GlcNAc to ManNAc, J05023 for E. coli CMP-SA synthetase, 
AJ006215 for murine CMP-SA synthetase, Z71268 for murine CMP-SA transporter, 
X03345 for E. coli aldolase, U05248 for E. coli SA synthetase, XI 7247 for human 2,6 
sialyltransferase, L29553 for human 2,3 sialyltransferase, M13214 for bovine 
galactosyltransferase, L77081 for human GlcNAc T-I, U15128 or L36537 for human 
GlcNAc T-H, D87969 for human CMP-SA transporter, and/or S95936 for human 
transferrin; and (c) nucleotide sequences complementary to any of the nucleotide 
sequences in (a) or (b), above. Polypeptides encoded by such nucleic acids may also be 
used according to the methods of the present invention. Further embodiments of the 
invention include use of isolated nucleic acid molecules that comprise a polynucleotide 
having a nucleotide sequence at least 80%, 85%, or 90% identical, and more preferably 
at least 95%, 97%, 98% or 99% identical, to any of the above nucleotide sequences, or 
a polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide that is complementary to any of the above nucleotide sequences. This 
polynucleotide which hybridizes does not hybridize under stringent hybridization 
conditions to a polynucleotide having a nucleotide sequence consisting of only A 
residues or of only T residues. Polypeptides encoded by such nucleic acids may also be 
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used according to the methods of the present invention. Preferably, the nucleic acid 
sequences (including fragments or variants) that may be used according to the methods 
of the present invention encode a polypeptide having a biological activity. Such 
biological activity may routinely be assayed using techniques described herein or 
otherwise known in the art. 

[02341 In addition to the sequences described above, the nucleotide sequences and 
[0235] amino acid sequences disclosed in Figures 27-32, and fragments and variants 
of these sequences may also be used according to the methods of the invention. 
[0236] In one embodiment, specific enzyme polypeptides comprise the amino acid 
sequences shown in Figures 28, 30 and 32; or otherwise described herein. However, 
the invention also encompasses sequence variants of the polypeptide sequences shown 
in Figures 28, 30 and 32. 

[0237] In a specific embodiment, one, two, three, four, five or more human 
polynucleotide sequences, or fragments, or variants thereof, and/or the polypeptides 
encoded thereby, are used according to the methods of the present invention to convert 
ManNAc to SA (see Example 6). Such polynucleotide and polypeptide sequences 
include, but are not limited to, sequences corresponding to human aldolase (SEQ ID 
NO:l and SEQ ID NO:2), human CMP-SA synthetase (SEQ ID NO:3 and SEQ ID 
NO:4), and human SA synthetase (SEQ ID NO:5 and SEQ ID NO:6); see also Figures 
27 - 32. Thus, in certain embodiments the methods of present invention include the use 
of one or more novel isolated nucleic acid molecules comprising polynucleotides 
encoding polypeptides important to intracellular carbohydrate processing in humans. 
Such polynucleotide sequences include those disclosed in the figures and/or Sequence 
Listing and/or encoded by the human cDNA plasmids (Human CMP-Sialic Acid 
Synthetase, cDNA clone HWLLM34; Human Sialic Acid Synthetase, cDNA clone 
HA5AA37; and Human Aldolase cDNA clone HDPAK85) deposited with the 
American Type Culture Collection (ATCC) on February 24, 2000 and receiving 
accession numbers PTA-1410. The present invention further includes the use of 
polypeptides encoded by these polynucleotides. The present invention also provides for 
use of isolated nucleic acid molecules encoding fragments and variants of these 
polypeptides, and for the polypeptides encoded by these nucleic acids. 




[0238] Thus, one aspect of the invention provides for use of isolated nucleic acid 
molecules comprising polynucleotides having nucleotide sequences selected from the 
group consisting of : (a) nucleotide sequences encoding human aldolase having the 
amino acid sequences as shown in SEQ ID NO:2; (b) nucleotide sequences encoding a 
biologically active fragment of the human aldolase polypeptide having the amino acid 
sequence shown in SEQ ID NO:2; (c) nucleotide sequences encoding an antigenic 
fragment of the human aldolase polypeptide having the amino acid sequence shown in 
SEQ ID NO:2; (d) nucleotide sequences encoding the human aldolase polypeptide 
comprising the complete amino acid sequence encoded by the plasmid contained in the 
ATCC Deposit; (e) nucleotide sequences encoding a biologically active fragment of the 
human aldolase polypeptide having the amino acid sequence encoded by the plasmid 
contained in the ATCC Deposit; (f) a nucleotide sequence encoding an antigenic 
fragment of the human aldolase polypeptide having the amino acid sequence encoded 
by the plasmid contained in the ATCC Deposit; and (g) nucleotide sequences 
complementary to any of the nucleotide sequences in (a) through (f), above. 
Polypeptides encoded by such nucleic acids may also be used according to the methods 
of the present invention. Further embodiments of the invention include use of isolated 
nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at 
least 80%, 85%, or 90% identical, and more preferably at least 95%, 97%, 98% or 99% 
identical, to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g), above, or 
a polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide in (a), (b), (c), (d), (e), (f), or (g), above. This polynucleotide which 
hybridizes does not hybridize under stringent hybridization conditions to a 
polynucleotide having a nucleotide sequence consisting of only A residues or of only T 
residues. Polypeptides encoded by such nucleic acids may also be used according to the 
methods of the present invention. 

[0239] Another aspect of the invention provides for use of isolated nucleic acid 
molecules comprising polynucleotides having nucleotide sequences selected from the 
group consisting of : (a) nucleotide sequences encoding human CMP-SA synthetase 
having the amino acid sequences as shown in SEQ ID NO:4; (b) nucleotide sequences 
encoding a biologically active fragment of human CMP-SA synthetase polypeptide 
having the amino acid sequence shown in SEQ ID NO:4; (c) nucleotide sequences 
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encoding an antigenic fragment of the human CMP-SA synthetase polypeptide having 
the amino acid sequence shown in SEQ ED NO:4; (d) nucleotide sequences encoding 
the human CMP-SA synthetase polypeptide comprising the complete amino acid 
sequence encoded by the plasmid contained in the ATCC Deposit; (e) nucleotide 
sequences encoding a biologically active fragment of the human CMP-SA synthetase 
polypeptide having the amino acid sequence encoded by the plasmid contained in the 
ATCC Deposit; (f) a nucleotide sequence encoding an antigenic fragment of the human 
CMP-SA synthetase polypeptide having the amino acid sequence encoded by the 
plasmid contained in the ATCC Deposit; and (g) nucleotide sequences complementary 
to any of the nucleotide sequences in (a) through (f), above. Polypeptides encoded by 
such nucleic acids may also be used according to the methods of the present invention. 
Further embodiments of the invention include use of isolated nucleic acid molecules 
that comprise a polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% 
identical, and more preferably at least 95%, 97%, 98% or 99% identical, to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a polynucleotide which 
hybridizes under stringent hybridization conditions to a polynucleotide in (a), (b), (c), 
(d), (e), (f), or (g), above. This polynucleotide which hybridizes does not hybridize 
under stringent hybridization conditions to a polynucleotide having a nucleotide 
sequence consisting of only A residues or of only T residues. Polypeptides encoded by 
such nucleic acids may also be used according to the methods of the present invention. 
[0240] Another aspect of the invention provides for use of isolated nucleic acid 
molecules comprising polynucleotides having nucleotide sequences selected from the 
group consisting of: (a) nucleotide sequences encoding human S A synthetase having 
the amino acid sequences as shown in SEQ ID NO:6; (b) nucleotide sequences 
encoding a biologically active fragment of the human SA synthetase polypeptide having 
the amino acid sequence shown in SEQ ID NO:6; (c) nucleotide sequences encoding 
an antigenic fragment of the human SA synthetase polypeptide having the amino acid 
sequence shown in SEQ ID NO:6; (d) nucleotide sequences encoding the human SA 
synthetase polypeptide comprising the complete amino acid sequence encoded by the 
plasmid contained in the ATCC Deposit; (e) nucleotide sequences encoding a 
biologically active fragment of the human SA synthetase polypeptide having the amino 
acid sequence encoded by the plasmid contained in the ATCC Deposit; (f) a nucleotide 
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sequence encoding an antigenic fragment of the human SA synthetase polypeptide 
having the amino acid sequence encoded by the plasmid contained in the ATCC 
Deposit; and (g) nucleotide sequences complementary to any of the nucleotide 
sequences in (a) through (f), above. Polypeptides encoded by such nucleic acids may 
also be used according to the methods of the present invention. Further embodiments 
of the invention include use of isolated nucleic acid molecules that comprise a 
polynucleotide having a nucleotide sequence at least 80%, 85%, or 90% identical, and 
more preferably at least 95%, 97%, 98% or 99% identical, to any of the nucleotide 
sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), 
or (g), above. This polynucleotide which hybridizes does not hybridize under stringent 
hybridization conditions to a polynucleotide having a nucleotide sequence consisting of 
only A residues or of only T residues. Polypeptides encoded by such nucleic acids may 
also be used according to the methods of the present invention. 

[0241] By a nucleic acid having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the nucleic acid is identical to the reference sequence 
except that the nucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the described 
polypeptide. In other words, to obtain a nucleic acid having a nucleotide sequence at 
least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in 
the reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. The query sequence may be an entire sequence, 
such as, for example, that shown of SEQ ED NO:l, the ORF (open reading frame), or 
any fragment as described herein. 

[0242] As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 
identical to a nucleotide sequence of the presence invention can be determined 
conventionally using known computer programs. A preferred method for determining 
the best overall match between a query sequence (a sequence of the present invention) 
and a subject sequence, also referred to as a global sequence alignment, can be 



43 



determined using the FASTDB computer program based on the algorithm of Brutlag et 
al. (Comp. App. Biosci. (1990) 6:237-245.) In a sequence alignment the query and 
subject sequences are both DNA sequences. An RNA sequence can be compared by 
converting U's to T's. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining 
Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty^, Gap 
Size Penalty 0.05, Window Size=500 or the length of the subject nucleotide sequence, 
whichever is shorter. 

[0243] If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 
5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the 
total bases of the query sequence. Whether a nucleotide is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected score 
is what is used for the purposes of the present invention. Only bases outside the 5' and 
3' bases of the subject sequence, as displayed by the FASTDB alignment, which are not 
matched/aligned with the query sequence, are calculated for the purposes of manually 
adjusting the percent identity score. 

[0244] For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject 
sequence and therefore, the FASTDB alignment does not show a matched/alignment of 
the first 10 bases at 5' end. The 10 unpaired bases represent 10% of the sequence 
(number of bases at the 5' and 3' ends not matched/total number of bases in the query 
sequence) so 10% is subtracted from the percent identity score calculated by the 
FASTDB program. If the remaining 90 bases were perfectly matched the final percent 
identity would be 90%. In another example, a 90 base subject sequence is compared 
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with a 100 base query sequence. This time the deletions are internal deletions so that 
there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually 
corrected. Once again, only bases 5' and 3' of the subject sequence which are not 
matched/aligned with the query sequence are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 
[0245] By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the query amino acid sequence. In other words, 
to obtain a polypeptide having an amino acid sequence at least 95% identical to a query 
amino acid sequence, up to 5% of the amino acid residues in the subject sequence may 
be inserted, deleted (indels) or substituted with another amino acid. These alterations 
of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or 
more contiguous groups within the reference sequence. 

[0246] As a practical matter, whether any particular polypeptide is at least, for 
example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to, for example, the 
amino acid sequences of SEQ ID NO:2 or to the amino acid sequence encoded by the 
cDNA contained in a deposited clone can be determined conventionally using known 
computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject sequence, 
also referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237- 
245(1990)). In a sequence alignment the query and subject sequences are either both 
nucleotide sequences or both amino acid sequences. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB amino acid 
alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, 
Randomization Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap 
Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject 
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amino acid sequence, whichever is shorter. 

[0247J If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be made 
to the results. This is because the FASTDB program does not account for N- and C- 
terminal truncations of the subject sequence when calculating global percent identity. 
For subject sequences truncated at the N- and C-termini, relative to the query sequence, 
the percent identity is corrected by calculating the number of residues of the query 
sequence that are N- and C-terminal of the subject sequence, which are not 
matched/aligned with a corresponding subject residue, as a percent of the total bases of 
the query sequence. Whether a residue is matched/aligned is determined by results of 
the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This final percent identity score is what is used 
for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That is, 
only query residue positions outside the farthest N- and C-terminal residues of the 
subject sequence. 

[0248] For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not show 
a matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired 
residues represent 10% of the sequence (number of residues at the N- and C- termini 
not matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query sequence. 
This time the deletions are internal deletions so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once again, 
only residue positions outside the N- and C-terminal ends of the subject sequence, as 
displayed in the FASTDB alignment, which are not matched/aligned with the query 
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sequence are manually corrected for. No other manual corrections are to made for the 
purposes of the present invention. 

[0249] In another embodiment of the invention, to determine the percent homology 
of two amino acid sequences, or of two nucleic acids, the sequences are aligned for 
optimal comparison purposes (e.g., gaps can be introduced in the sequence of one 
protein or nucleic acid for optimal alignment with the other protein or nucleic acid). 
The amino acid residues or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position in one sequence is occupied 
by the same amino acid residue or nucleotide as the corresponding position in the other 
sequence, then the molecules are homologous at that position. As used herein, amino 
acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity". 
The percent homology between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., per cent homology equals the number 
of identical positions/total number of positions times 100). 

[0250] Variants of above described sequences include a substantially homologous 
protein encoded by the same genetic locus in an organism, i.e., an allelic variant. 
Variants also encompass proteins derived from other genetic loci in an organism, but 
having substantial homology to the proteins of Figures 27-32, or otherwise described 
herein. Variants also include proteins substantially homologous to the protein but 
derived from another organism, i.e., an ortholog. Variants also include proteins that are 
substantially homologous to the proteins that are produced by chemical synthesis. 
Variants also include proteins that are substantially homologous to the proteins that are 
produced by recombinant methods. As used herein, two proteins (or a region of the 
proteins) are substantially homologous when the amino acid sequences are at least 
about 55-60%, typically at least about 70-75%, more typically at least about 80-85%, 
and most typically at least about 90-95% or more homologous. A substantially 
homologous amino acid sequence, according to the present invention, will be encoded 
by a nucleic acid sequence hybridizing to the nucleic acid sequence, or portion thereof, 
of the sequence shown in Figures 27, 28, 31 or otherwise described herein under 
stringent conditions as more fully described below. 

[0251] Orthologs, homologs, and allelic variants that are encompassed by the 
invention and that may be used according to the methods of the invention can be 
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identified using methods well known in the art. These variants comprise a nucleotide 
sequence encoding a protein that is at least about 55%, typically at least about 70-75%, 
more typically at least about 80-85%, and most typically at least about 90-95% or more 
homologous to the nucleotide sequence shown in Figures 27, 29, 31, or otherwise 
described herein, or a fragment of this sequence. Such nucleic acid molecules can 
readily be identified as being able to hybridize under stringent conditions, to the 
nucleotide sequence shown in Figures 27, 29, 31, or complementary sequence thereto, 
or otherwise described herein, or a fragment of the sequence. It is understood that 
stringent hybridization does not indicate substantial homology where it is due to general 
homology, such as poly A sequences, or sequences common to all or most proteins in 
an organism or class of proteins. 

[0252] The invention also encompasses polypeptides having a lower degree of 
identity but having sufficient similarity so as to perform one or more of the same 
functions performed by the enzyme polypeptides described herein. Similarity is 
determined by conserved amino acid substitution. Such substitutions are those that 
substitute a given amino acid in a polypeptide by another amino acid of like 
characteristics (see Table 1). Conservative substitutions are likely to be phenotypically 
silent. Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the 
hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, 
substitution between the amide residues Asn and Gin, exchange of the basic residues 
Lys and Arg and replacements among the aromatic residues Phe, Tyr. Guidance 
concerning which amino acid changes are likely to be phenotypically silent are found in 
Bowie et ah, Science 247:1306-1310 (1990). 
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TABLE 1. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 



[0253] Both identity and similarity can be readily calculated {Computational 

Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, 
New York, 1993; Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and 
Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). Preferred 
computer program methods to determine identify and similarity between two sequences 
include, but are not limited to, GCG program package (Devereux, J. (1984) Nuc. Acids 
Res. 72(7,1:387), BLASTP, BLASTN, FASTA (Atschul, S.F. (1990) J. Molec. Biol. 
275:403). 
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[0254] A variant polypeptide can differ in amino acid sequence by one or more 

substitutions, deletions, insertions, inversions, fusions, and truncations or a 
combination of any of these. 

[0255] Variant polypeptides can be fully functional or can lack function in one 

or more activities. Thus, in the present case, variations can affect the function, for 
example, of one or more of the modules, domains, or functional subregions of the 
enzyme polypeptides of the invention. Preferably, polypeptide variants and fragments 
have the described activities routinely assayed via bioassays described herein or 
otherwise known in the art. 

[0256] Fully functional variants typically contain only conservative variation or 

variation in non-critical residues or in non-critical regions. Functional variants can also 
contain substitution of similar amino acids, which result in no change or an 
insignificant change in function. Alternatively, such substitutions may positively or 
negatively affect function to some degree. 

[0257] Non-functional variants typically contain one or more non-conservative 

amino acid substitutions, deletions, insertions, inversions, or truncation or a 
substitution, insertion, inversion, or deletion in a critical residue or critical region. As 
indicated, variants can be naturally-occurring or can be made by recombinant means or 
chemical synthesis to provide useful and novel characteristics for the polypeptide. 
[0258] Amino acids that are essential for function can be identified by methods 

known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
(Cunningham et al, Science 24*1081-1085 (1989)). The latter procedure introduces 
single alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for biological activity. Sites that are critical can also be 
determined by structural analysis such as crystallization, nuclear magnetic resonance or 
photoaffinity labeling (Smith et al.,J. Mol. Biol. 224:899-904 (1992); de Vos et al. 
Science 255:306-312 (1992)). 

[0259] The invention further encompasses variant polynucleotides, and 

fragments thereof, that differ from the nucleotide sequence, such as, for example, those 
shown in Figures 27, 29, 31 or otherwise described herein, due to degeneracy of the 
genetic code and thus encode the same protein as that encoded by the nucleotide 
sequence shown in Figures 27, 29, 31 or otherwise described herein. 
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[0260] The invention also provides nucleic acid molecules encoding the variant 

polypeptides described herein. Such polynucleotides may be naturally occurring, such 
as allelic variants (same locus), homologs (different locus), and orthologs (different 
organism), or may be constructed by recombinant DNA methods or by chemical 
synthesis. Such non-naturally occurring variants may be made by mutagenesis 
techniques, including those applied to polynucleotides, cells, or organisms. 
Accordingly, as discussed above, the variants can contain nucleotide substitutions, 
deletions, inversions and insertions. 

[0261] Variation can occur in either or both the coding and non-coding regions. 

The variations can produce both conservative and non-conservative amino acid 
substitutions. 

[0262] "Polynucleotides" or "nucleic acids" that may be used according to the 

methods of the invention also include those polynucleotides capable of hybridizing, 
under stringent hybridization conditions, to sequences contained in SEQ ED NO:l, the 
complement thereof, or a cDNA within the deposited plasmids. As used herein, the 
term "hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences encoding a receptor at 
least 55% homologous to each other typically remain hybridized to each other. The 
conditions can be such that sequences at least about 65%, at least about 70%, or at least 
about 75% or more homologous to each other typically remain hybridized to each other. 
Such stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 
One example of stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45degrees C, followed by one or more washes in 
0.2 X SSC, 0.1% SDS at 50-65 degrees C. 

[0263] Also contemplated for use according to the methods of the invention are 

nucleic acid molecules that hybridize to a polynucleotide disclosed herein under lower 
stringency hybridization conditions. Changes in the stringency of hybridization and 
signal detection are primarily accomplished through the manipulation of formamide 
concentration (lower percentages of formamide result in lowered stringency); salt 
conditions, or temperature. For example, lower stringency conditions include an 
overnight incubation at 37 degree C in a solution comprising 6X SSPE (20X SSPE = 
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3M NaCl; 0.2M NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 
ug/ml salmon sperm blocking DNA; followed by washes at 50 degree C with 1XSSPE, 
0.1% SDS. In addition, to achieve even lower stringency, washes performed following 
stringent hybridization can be done at higher salt concentrations (e.g. 5X SSC). 
[0264] Note that variations in the above conditions may be accomplished 

through the inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

[0265] Of course, a polynucleotide which hybridizes only to polyA+ sequences 

(such as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone generated using oligo-dT as a primer). 
[0266] In one embodiment, an isolated nucleic acid molecule that hybridizes 

under stringent conditions to a sequence disclosed herein, or the complement thereof, 
such as, for example, the sequence of Figures 27, 29, 31, corresponds to a naturally- 
occurring nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid 
molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs 
in nature (e.g., encodes a natural protein). 

[0267] The present invention also encompasses recombinant vectors, which 

include the isolated nucleic acid molecules and polynucleotides that may be used 
according to the methods of the present invention, and to host cells containing the 
recombinant vectors arid/or nucleic acid molecules, as well as to methods of making 
such vectors and host cells and for using them for production of glycosylation enzyme 
by recombinant techniques. Polypeptides produced by such methods are also provided. 
[0268] The invention encompasses utilizing vectors for the maintenance 

(cloning vectors) or vectors for expression (expression vectors) of the desired 
polynucleotides encoding the carbohydrate processing of the invention, or those 
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encoding proteins to be sialylated by the methods of the invention and/or by expression 
of the proteins the cells of the invention. The vectors can function in prokaryotic or 
eukaryotic cells or in both (shuttle vectors). 

[0269] In one embodiment, one or more of the polynucleotide sequences used 

according to the methods of the invention are inserted into commercially, publicly, or 
otherwise available baculovirus expression vectors for enhanced expression of the 
corresponding enzyme. In another non-exclusive embodiment, one ore more of the 
polynucleotides used according to the methods of the invention are inserted into other 
viral vectors or for generation of stable insect cell lines. Techniques known in the art, 
such as, for example, HPAEC and HPLC techniques, may be routinely used to evaluate 
the enzymatic activity of these enzymes from both eukaryotic and bacterial sources to 
determine which source is best for generating SA in insect cells. 

[0270] Generally, expression vectors contain cis-acting regulatory regions that 

are operably linked in the vector to the polynucleotide to be expressed, or other relevant 
polynucleotides such that transcription of the polynucleotides is allowed in a host cell. 
The polynucleotides can be introduced into the host cell with a separate polynucleotide 
capable of affecting transcription. Thus, the second polynucleotide may provide a 
trans-acting factor interacting with the cis-regulatory control region to allow 
transcription of the polynucleotides from the vector. Alternatively, a trans-acting factor 
may be supplied by the host cell. Finally, a trans-acting factor can be produced from 
the vector itself. 

[0271] It is understood, however, that in some embodiments, transcription of 

the polynucleotides can occur in a cell-free system. 

[0272] The regulatory sequence to which the polynucleotides described herein 

can be operably linked include, for example, promoters for directing mRNA 
transcription. These promoters include, but are not limited to, baculovirus promoters 
including, but not limited to, 1E0, 1E1, 1E2, 39k, 35k, egt, ME53, ORF 142, PE38, 
p6.9, capsid, gp64 polyhedrin, plO, basic and core; and insect cell promoters including, 
but not limited to, Drosophila actin, metallothionine, and the like. Where the host cell is 
not an insect cell, such promoters include, but are not limited to, the left promoter from 
bacteriophage lambda, the lac, TRP, and TAC promoters from E. coli, promoters from 
Actinomycetes, including Nocardia, and Streptomyces. 
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[0273] Promoters may be isolated, if they have not already been isolated, by 

standard promoter identification and trapping methods known in the art, see, for 
example, in Sambrook et al. 9 Molecular Cloning: A Laboratory Manual. 2nd. ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989). 
[0274] It would be understood by a person of ordinary skill in the art that the 

choice of promoter would depend upon the choice of host cell. Similarly, the choice of 
host cell will depend upon the use of the host cell. Accordingly, host cells can be used 
for simply amplifying, but not expressing, the nucleic acid. However, host cells can 
also be used to produce desirable amounts of the desired polypeptide. In this 
embodiment, the host cell is simply used to express the protein per se. For example, 
amounts of the protein could be produced that enable its purification and subsequent 
use, for example, in a cell free system. In this case, the promoter is compatible with the 
host cell. Host cells can be chosen from virtually any of the known host cells that are 
manipulated by the methods of the invention to produce the desired glycosylation 
patterns. These could include mammalian, bacterial, yeast, filamentous fungi, or plant 
cells. 

[0275] In addition to control regions that promote transcription, expression 

vectors may also include regions that modulate transcription, such as repressor binding 
sites and enhancers. 

[0276] In addition to containing sites for transcription initiation and control, 

expression vectors can also contain sequences necessary for transcription termination 
and, in the transcribed region a ribosome binding site for translation. Other regulatory 
control elements for expression include initiation and termination codons as well as 
polyadenylation signals. The person of ordinary skill in the art would be aware of the 
numerous regulatory sequences that are useful in expression vectors. Such regulatory 
sequences are described, for example, in Sambrook et al. 9 cited above. 
[0277] Depending on the choice of a host cell, a variety of expression vectors 

can be used to express the polynucleotide. Such vectors include chromosomal, 
episomal, and particularly virus-derived vectors, for example, AcMNPV, OpMNPV, 
BmNPV, HzMNPV, and RoMNPV. Vectors may also be derived from combinations 
of these sources such as those derived from plasmid and bacteriophage genetic 
elements, e.g. cosmids and phagemids. Appropriate cloning and expression vectors for 
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prokaryotic and eukaryotic hosts are described in Sambrook et al., Molecular Cloning: 
A Laboratory Manual. 2nd. ed. , Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, (1989). 

[0278] The regulatory sequence may provide constitutive expression in one or 

more host cells or may provide for inducible expression in one or more cell types such 
as by temperature, nutrient additive, or exogenous factor such as a hormone or other 
ligand. A variety of vectors providing for constitutive and inducible expression in 
prokaryotic and eukaryotic hosts are well known to those of ordinary skill in the art. 
[0279] The polynucleotides can be inserted into the vector nucleic acid using 

techniques known in the art. Generally, the DNA sequence that will ultimately be 
expressed is joined to an expression vector by cleaving the DNA sequence and the 
expression vector with one or more restriction enzymes and then ligating the fragments 
together. Procedures for restriction enzyme digestion and ligation are well known to 
those of ordinary skill in the art. 

[0280] Specific expression vectors are described herein for the purposes of the 

invention; for example, AcMNPV. Other expression vectors listed herein are not 
intended to be limiting, and are merely provided by way of example. The person of 
ordinary skill in the art would be aware of other vectors suitable for maintenance, 
propagation, or expression of the polynucleotides described herein. These are found for 
example in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 
Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989. Any cell type or expression system 
can be used for the purposes of the invention including but not limited to, for example, 
baculovirus systems (O'Riley et al. (1992) Baculovirus Expression Vectors, W.H. 
Freeman and Company, New York 1992) and Drosophila-denved systems (Johansen et 
al. (1989) Genes Dev 5(tfJ:882-889). 

[0281] The invention also encompasses vectors in which the nucleic acid 

sequences described herein are cloned into the vector in reverse orientation, but 
operably linked to a regulatory sequence that permits transcription of antisense RNA. 
Thus, an antisense transcript can be produced to all, or to a portion, of the 
polynucleotide sequences described herein, including both coding and non-coding 
regions. Expression of this antisense RNA is subject to each of the parameters 
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described above in relation to expression of the sense RNA (regulatory sequences, 
constitutive or inducible expression, tissue-specific expression). 
[0282] The recombinant host cells are prepared by introducing the vector 

constructs described herein into the cells by techniques readily available to the person 
of ordinary skill in the art. These include, but are not limited to, calcium phosphate 
transfection, DEAE-dextran-mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection, lipofection, and other techniques such as those 
found in Sambrook, et al. {Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989). 

[0283] Where secretion of the polypeptide is desired, appropriate secretion 

signals known in the art are incorporated into the vector using techniques known in the 
art. The signal sequence can be endogenous to the polypeptides or heterologous to 
these polypeptides. 

[0284] Where the polypeptide is not secreted into the medium, the desired 

protein can be isolated from the host cell by techniques known in the art, such as, for 
example, standard disruption procedures, including freeze thaw, sonication, mechanical 
disruption, use of lysing agents and the like. The polypeptide can then be recovered 
and purified by well-known purification methods including, but not limited to, 
ammonium sulfate precipitation, acid extraction, anion or cationic exchange 
chromatography, phosphocellulose chromatography, hydrophobic-interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography, lectin 
chromatography, and high performance liquid chromatography. 
[0285] Furthermore, for suppressing activity of endogenous N- 

acetylglucosaminidase, the invention encompasses utilizing the sequences deduced 
from the fragment identified in Figure 18, and described in Example 4. More 
particularly, in this aspect, the invention comprises utilization of the glucosaminidase 
nucleotide sequences which are produced by using primers, such as, for example, those 
primer combinations described in Example 4. These nucleotide sequences may be used 
in the construction and expression of anti-sense RNA, ribozymes, or homologous 
recombination (gene "knock-out") constructs, using methods readily available to those 
skilled in the art, to reduce or eliminate in vivo glucosaminidase activity. 



• 
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[0286] Cell lines produced by the methods of the invention can be tested by 

expressing a model recombinant glycoprotein in such cell lines and assessing the N- 
glycans attached therein using techniques described herein or otherwise known in the 
art. The assessment can be done, for example, by 3-dimensional HPLC techniques. In 
the Examples of the invention, human transferrin is used as a model target glycoprotein, 
since this glycoprotein is sialylated in humans and extensive oligosaccharide structural 
information for the protein is available (Montreuil et al. (1997) Glycoproteins //Ed. 
203-242). In this manner, cell lines with superior processing characteristics are 
identified. Such a cell line can then be evaluated for its growth rate, product yields, and 
capacity to grow in suspension culture (Lindsay et al. (1992) Biotech, and Bioeng. 
39:614-61 8, Reuveny a/. (1992) Ann. NY Acad. Sci. 665:320, Reuveny et al. (1993) 
Appl. Microbiol. Biotechnol. 38:619-623, Reuveny et al. (1993) Biotechnol Bioeng. 
42:235-239). 

[0287] The invention encompasses expressing heterologous proteins in the cells 

of the invention and/or according to the methods of the invention for any purpose 
benefiting from such expression. Such a purpose includes, but is not limited to, 
increasing the in vivo circulatory half life of a protein; producing a desired quantity of 
the protein; increasing the biological function of the. protein including, but not limited 
to, enzyme activity, receptor activity, binding capacity, antigenicity, therapeutic 
property, capacity as a vaccine or a diagnostic tool, and the like. Such proteins may be 
naturally occurring chemically synthesized or recombinant proteins. Examples of 
proteins that benefit from the heterologous expression of the invention include, but are 
not limited to, transferrin, plasminogen, Na + , K + -ATPase , thyrotropin, tissue 
plasminogen activator, erythropoietin, interleukins, and interferons. Other examples of 
such proteins include, but are not limited to, those described in International patent 
application publication number WO 98/06835, the contents of which are herein 
incorporated by reference. 

[0288] In one embodiment, proteins that benefit from the heterologous 

expression of the invention are mammalian proteins. In this aspect, mammals include 
but are not limited to, cats, dogs, rats, mice, cows, pigs, non-human primates, and 
humans. 
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[0289] It is recognized that the heterologous expression of the invention not 

only encompasses proteins that are sialylated in their native source; but also those that 
are not sialylated as such, and benefit from the expression in the cells of and/or 
according to the methods of the invention. 

[0290] It is recognized that proteins that are not sialylated in their native source, 

can be altered by known genetic engineering methods so that the heterologous 
expression of the protein according to the invention will result in sialylation of the 
protein. Such methods include, but are not limited to, the genetic engineering methods 
described herein. In this aspect, it is further recognized that altering the proteins could 
encompass engineering into the protein targeting signals to ensure targeting of the 
proteins to the ER and Golgi apparatus for sialylation, where such signals are needed. 
[0291] It is also recognized that the cells of the invention contain proteins, 

which are not sialylated prior to manipulation of the cells according to the methods of 
the invention, but are sialylated subsequent to the manipulation. In this manner, the 
invention also encompasses proteins that have amino acid sequences that are 
endogenous to the cells of the invention, but are sialylated as a result manipulation of 
the cells according to the methods of the invention. 

[0292] It is recognized that the analysis of the N-glycans produced according to 

the methods of the invention may suggest additional strategies to further enhance the 
sialylation of glycoproteins in insect cells. If the production of Gal containing 
carbohydrate acceptor structures is low relative to those containing GlcNAc, then the 
levels of Gal transferase expression are increased by integrating multiple copies of this 
gene into the insect cell genome or by expressing Gal T under a stronger promoter 
using techniques described herein or otherwise known in the art. Additionally, or 
alternatively, substrate feeding strategies are used to enhance the levels of UDP-Gal for 
this carbohydrate processing reaction. In contrast, if the fraction of carbohydrate 
structures terminating in Gal is high and the fraction with terminal S A is low, then 
sialyltransferase or CMP-S A production is enhanced. Examination of sialyltransferase 
activity using techniques described herein or otherwise known in the art, such as, for 
example, FRET or HPLC and CMP-SA levels using HPAEC, is used to determine 
which step is the metabolic limiting step to sialylation. These metabolic limitations are 
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overcome by increasing expression of specific enzymes or by altering substrate feeding 
strategies or a combination thereof. 

ASSAYS 

[0293] Having generally described the invention, the same will be more readily 

understood by reference to the following assays and examples, which are provided by 
way of illustration and are not intended as limiting. 

[0294] Analytical bioassays are implemented to evaluate enzymatic activities in 

the N-glycosylation pathway of insect cells. In order to screen a larger selection of 
insect cells for particular oligosaccharide processing enzymes, bioassays in which 
multiple samples can be analyzed simultaneously are advantageous. Consequently, 
bioassays based on fluorescence energy transfer (FRET) and time-resolved fluorometry 
of europium (Eu) are designed to screen native and recombinant insect cell lines for 
carbohydrate processing enzymes in a format that can handle multiple samples. 
[0295] Fluorescence assays are especially useful in detecting limiting steps in 

carbohydrate processing due to their sensitivity and specificity. FRET and Eu assays 
detect enzymatic activities at levels as low as 10" 14 M, which is greater than the 
sensitivity obtained with 125 L In addition, the use of substrates modified with 
fluorophores enables the measurement of one specific enzyme activity in an insect cell 
lysate, and multiple samples can be analyzed simultaneously in a microtiter plate 
configuration used in an appropriate fluorometer. With these assays, insect cell lines are 
rapidly screened for the presence of processing enzymes including Gal, GlcNAc, and 
sialic acid transferases to identify limiting enzymes in N-glycosylation in native and 
recombinant cells. 

Fluorescence energy transfer (FRET) assays 
[0296] Glycosyl transferase activity assays are based on the principle of 

fluorescence energy transfer (FRET), which has been used to study glycopeptide 
conformation (Rice et al (1991) Biochemistry 30:6646-6655) and to develop endo-type 
glycosidase assays (Lee et al. (1995) Anal Biochem. 230:31-36). 
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Gal T assay 

[0297] The fluorescent compound, UDP-Gal-6-Naph, synthesized by 

consecutive reactions of galactose oxidase (generating 6-oxo compound) and reductive 
amination with naphthylamine, is found to be effective as a substrate for Gal 
transferase. When UDP-Gal-6-Naph is reacted with an acceptor carrying a dansyl 
group (Dans-AE-GlcNAc) in the presence of Gal-T, a product is created that can 
transfer energy (Figure 12). While irradiation of the naphthyl group in UDP-Gal-6- 
Naph at 260-290 nm ("ex" in Figure 13) results in the usual emission at 320-370 nm 
("em" dotted line in Figure 13), irradiation of the product at these same low 
wavelengths results in energy transfer to the dansyl group and emission at 500-560 nm 
("em" solid line in Figure 13). Assay sensitivity is as great as the fluorometer allows 
(pico- to femtomol range) and exceeds that of radioisotopes. In addition, multiple 
samples can be monitored simultaneously in the fluorometer, allowing a number of cell 
lines to be evaluated rapidly for Gal T activity. 

Sialyltransferase assay 
[0298] A sialyltransferase assay is designed using similar FRET technology 

described in the above example for Gal T. The 3-carbon tail (exocyclic chain) of sialic 
acid (in particular, its glycoside) can be readily oxidized with mild periodate to yield an 
aldehyde (Figure 14). This intermediate is reductively aminated to generate a 
fluorescently tagged sialic acid (after removal of its aglycon), which is then modified to 
form a fluorescently modified CMP-sialic acid (See also Lee et al. (1994) Anal. 
Biochem. 216:358-364, Brossamer et al. (1994) Methods Enzymol. 247:153-177). The 
acceptor substrate is modified as described above to include the dansyl group. Then the 
FRET approach is used to measure either alpha(2, 3) or alpha(2, 6) sialyltransferase 
activity since these enzymes should utilize the modified CMP-SA as donor substrate to 
generate a product with altered fluorescent emission characteristics. 
[0299] The choice of the fluorescent donor and acceptor pair can be flexible. 

The above examples are given using naphfhyl-dansyl pairs, but other fluorescent 
combinations may be even more sensitive (Wu et al. (1994) Anal. Biochem. 250:260- 
262). 
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Europium (Eu +) ) fluorescence assays. 
[0300] An example of the use of Eu +3 fluorescence for the evaluation of Gal T 

activity is provided herein in the N-linked oligosaccharides from insect cells. The same 
techniques are used to develop enzymatic assay for transferases such as GlcNAc Tl and 
glycosidases such as N-acetylglucosaminidase. Further enhancements in sensitivity are 
obtained with the advent of the super-sensitive Eu-chelator, BHHT (4, 4'- bis (1", 1", I", 
2", 2", 3", 3'-heptatluro-4", 6"-hexanedione-6'-yl)-chlorosulfo-o~terphenyl) (Yuan et 
al. (1998) Anal. Chem. 70:596-601), which allows detection down to the lower finol 



range. 



GlcNac-TI Assay 

[0301] A new GlcNAc-TI assay, illustrated in Figure 1 5, utilizes a synthetic 6- 

aminohexyl glycoside of the trimannosyl N-glycan core structure labeled with DTP A 
(Diethylenetriaminepentaacetic acid) and complexed with Eu +3 . This substrate is then 
incubated with insect cell lysates or positive controls containing GlcNAc Tl and UDP- 
GlcNAc. Addition of chemical inhibitors are used to minimize background N- 
acetylglucosaminidase activity. After the reaction, an excess of Crocus lectin CVL 
(Misaki et al. (1997) J. Biol. Chem. 272:25455-25461), which specifically binds the 
trimannosyl core, is added. The amount of the lectin required to bind all the trimannosyl 
glycoside (and hence all the Eu +3 label) in the absence of any GlcNAc binding is 
predetermined. The reacted mixture is then filtered through a 10,000 molecular weight 
cut off (MWCO) microfuge ultrafiltration cup. The glycoside modified with GlcNAc 
does not bind CVL and appears in the filtrate. Measurement of the Eu +3 fluorescence in 
the filtrate reflects the level of GlcNAc Tl activity in the culture lysates. 

N-acetylglucosaminidase assay 
[0302] An assay for N-acetylglucosaminidase activity is developed using a 

different lectin, GS-H, which is specific for GlcNAc. The substrate is prepared by 
modification of the same trimannosyl core glycoside described above using in vitro 
purified GlcNAc Tl, which results in addition of a GlcNAcJ>eta(l-2) residue to the 
Man_alpha(\-3) residue. Following incubation with insect cell lysates, enzymatic 
hydrolysis by N-acetylglucosaminidase removes GlcNAc from the substrate resulting in 
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the tri-mannosyl core product. The product is not susceptible to lectin binding and thus 
escapes into the filtrate. Evaluation of Eu +3 fluorescence in the filtrate provides a 
measure of the N-acetylglucosaminidase activity. Alternatively, enhanced binding of 
the Eu-bound trimannosyl core to the Crocus lectin described above can be used as 
another assay for N-acetylglucosaminidase activity. 

Characterization ofN-linked Oligosaccharides from Insect Cells 
[0303] Carbohydrate structure elucidation of the N-glycans of a recombinant 

glycoprotein, IgG, purified from Trichoplusia ni (High Five™ cells; Invitrogen Corp., 
Carlsbad, CA, USA) has been undertaken (Davis et al. (1993) In Vitro Cell. Dev. Biol. 
29:842-846; Hsu et al. (1997) J. Biol. Chem. 272:9062-9070). The recombinant 
glycoprotein, immunoglobulin G (IgG), was purified from both intracellular and 
extracellular (secreted) sources and all the attached N-glycans determined using three 
dimensional HPLC techniques. The composition of these structures provided insights 
into the carbohydrate processing pathways present in insect cells and allowed a 
comparison of intracellular and secreted N-glycan structures. 

[0304] The Trichoplusia ni cells grown in serum free medium in suspension 

culture were infected with a baculovirus vector encoding a murine IgG (Summers et al. 
(1987) A manual of methods for baculovirus vectors and insect cells culture 
procedures). IgG includes an N-linked oligosaccharide attachment on each of the two 
heavy chains. 

[0305] Heterologous IgG was purified from the culture supernatant and soluble 

cell lysates using a Protein A-Sepharose column. N-linked oligosaccharides were 
isolated following protease digestion of IgG and treatment with glycoamidase A to 
release the N-glycans. Oligosaccharides were then derivatized with 2-aminopyridine 
(PA) at the reducing ends to provide fluorogenic properties for detection. 
[0306] Three-dimensional HPLC analysis, was performed to elucidate the N- 

linked oligosaccharide structures attached to the heavy chain of IgG (Tomiya et al. 
(\9%%)Anal. Biochem. 171:73-90, Takahashi et al. (1992) Handbook of 
Endoglycosidases and Glycoamidases Ed. 199-332). This technique separates 
oligosaccharides by three successive HPLC steps and enables the identification of 
structures by comparison of elution conditions with those of known standards. 
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[0307] A DEAE column was used to separate oligosaccharides on the basis of 

carbohydrate acidity (first dimension). None of the oligosaccharides retained on this 
column were found to include sialic acid. Treatment of the acidic fractions with 
neuraminidase from Arthrobacter ureafaciens (known to cleave all known sialic acid 
linkages) failed to release any sialic acid, and ODS-chromatography of the fractions 
revealed several minor components different from all known sialylated 
oligosaccharides. 

[0308] The second dimension used reverse phase HPLC with an ODS-silica 

column to fractionate the labeled oligosaccharides according to carbohydrate structure. 
Supernatant (S) and lysate (L) IgGs oligosaccharides were separated into 6 and 10 
fractions, respectively, labeled A-L in Figure 6. 

[0309] Separation in the third and final dimension was accomplished using an 

amide column to isolate oligosaccharides on the basis of molecular size. Peak B from 
the ODS column was separated into two separate oligosaccharide fractions, and peak H 
was separated into three separate oligosaccharide fractions on the amide-column. 
[0310] After oligosaccharide purification, structures of unknown 

oligosaccharides were determined by comparing their positions on the 3-dimensional 
map with the positions of over 450 known oligosaccharides. Co-elution of an unknown 
sample with a known PA-oligosaccharide on the ODS and amide-silica columns was 
used to confirm the identity of an oligosaccharide. Digestion by glycosidases with 
specific cleavage sites (a/p//a-L-fucosidase, 6eta-galactosidase, beta-N- 
acetylglucosaminidase, and a/p/w-mannosidase) followed by reseparation provided 
further confirmation. 

[031 1] AH the oligosaccharides in the culture medium and cell lysates matched 

known carbohydrates except for oligosaccharide G. The structure of oligosaccharide G 
was elucidated by treatment of the N-glycan with a/p/^-L-fucosidase, known to digest 
Fuc_a/pAal-6GIcNAc, followed by treatment with 13.5 M trifluoroacetic acid to 
remove the alpha\, 3 linked fucose. The de-alpha\, 6- and de-alphal, 3-fucosylated 
oligosaccharide G co-eluted with a known oligosaccharide, allowing the identification 
of G. The structure of oligosaccharide G is shown in Figure 7. 
[03 1 2] The structure of oligosaccharide G was further confirmed by 1 H-NMR 

and electrospray ionization (ESI) mass spectrometry (Hsu et al. (1997) J. Biol. Chem. 
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272:9062-9070). Thus, the combination of these techniques can be used to elucidate 
both known and unknown oligosaccharides. 

[0313] The carbohydrates attached to IgG from the culture medium and 

intracellular lysate were identified and the levels present in each source were quantified. 
These structures were then used in conjunction with previous studies of oligosaccharide 
processing in insect cells (Altmann et al. (1996) Trends in Glycoscience and 
Glycotechnology 8:101-1 14) to generate a detailed map of N-linked oligosaccharide 
processing in Trichoplitsia ni insect cells. The pathway and the levels of the 
oligosaccharides from secreted and intracellular sources are detailed in Figure 8. 
[0314] The initial processing in the T. ni cells appears to be similar to the 

mammalian pathway, including trimming of the terminal glucose and mannose 
residues. The trimming process follows a linear pathway with the exception of two 
different forms of the Man 7 GlcNAc 2 (M7GN, in Figure 8 also observed in native insect 
glycoproteins (Altmann et al. (1996) Trends in Glycoscience and Glycotechnology 
8:101-1 14) and IgG 4 , from NS/0 cells (Ip et al. (1994) Arch. Biochem. Biophys. 
308:387-399). The presence of these two Man 7 forms suggests the possible existence of 
an alternative processing pathway that yields Man 7 GlcNAc 2 through the action of endo- 
<z/p/ja-mannosidase. Following cleavage of the mannose residues, GlcNAc (GN) is 
added to the alpha 1 ,3 branch of Man 5 GlcNAC 2 by GlcNAc TI (N- 
acetylglusosaminyltransferase I) (Altmann et al. (1996) Trends in Glycoscience and 
Glycotechnology 8:101-1 14). However, GlcNAc, Man 5 GlcNAC 2 must be a short-lived 
intermediate quickly processed by alpha-Man II, since this structure was not detected in 
the T. ni cell lysate. At the GlcNAc u Man 3 GlcNAc 2 oligosaccharide, several branching 
steps in the N-glycan processing pathway are possible in insect cells. Complex 
glycoforms can be generated by the action of GlcNAc Til (N- 
acetylglucosaminyltransferase IT) and Gal T (galactosyltransferase T) to provide 
oligosaccharides which include terminal GlcNAc (GN) and Gal (G) residues. None of 
the complex oligosaccharide structures included sialic acid indicating that sialylation is 
negligible or non-existent in these cells. 

[0315] The production of these complex glycoforms must compete with an 

alternative processing pathway that is catalyzed by N-acetylglucosaminidase (Altmann 
et al. (1995) J. Biol. Chem. 270:17344-17349) (see Branch Points in Figure 8), leading 
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to the production of hybrid and paucimannosidic structures. While the complex-type N- 
glycans represent 35% of the total secreted glycoforms (supernatant % in Figure 8), the 
majority of secreted N-glycans are either paucimannosidic (35%) or hybrid structures 
(30%). Furthermore, those complex structures with a branch terminating in Gal 
represent less than 20% of the total secreted glycoforms and no structures were 
observed with terminal Gal on both branches of the N-glycan. 

[0316] In contrast to the secreted glycoforms, the intracellular N-glycans (lysate 

% in Figure 8) obtained from insect cells include more than 50% high-mannose type 
structures. The fraction of intracellular complex oligosaccharides is less than 15% and 
only 8% include a terminal Gal residue. The high level of high-mannose structures from 
intracellular sources indicates significantly less oligosaccharide processing for most of 
the intracellular immunoglobulins. Many of these intracellular immunoglobulins may 
not reach the compartments in which carbohydrate trimming takes place (Jarvis et al 
(1989) Mol Cell Biol 9:214-223). High mannose glycoforms are also observed 
intracellular^ for mammalian cells (Jenkins et al (1998) Cell Culture Engineering VI). 



Examples 



Example 1 : Evaluation of N-glycosvlation Pathway Enzymes 
[0317] The levels of N-linked oligosaccharide processing enzymes are 

measured using analytical assays to characterize carbohydrate processing in native and 
recombinant insect cells. These assays are used to compare the N-glycan processing 
capacity of different cell lines and to evaluate changes in processing and metabolite 
levels following metabolic engineering modifications. 

[0318] High Performance Anion Exchange Chromatography (HPAEC) assay 

for galactose transferase 

[0319] HPAEC is used in combination with pulsed amperometric detection 

(HPAEC-PAD) or conductivity to detect metabolite levels in the CMP-SA pathway and 
to evaluate N-linked oligosaccharide processing enzymes essentially as described by 
(Lee et al (1990) Anal Biochem. 34:953-957, Lee et al (1996) J. Chromatography A 
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720:137-149). Shown in Figure 9 is an example of the use of HP AEC-P AD for 
measuring Gal T activity by following the lactose formation reaction: 

UDP-Gal + Glc GalT Lac + UDP 

[0320] The peak labeled "Lac" indicates the formation of the product lactose 

(Lac). Many of the enzymes involved in N-glycosylation (e.g., aldolase, CMP-NeuAc 
synthetase, sialyltransferase) and metabolic intermediates (e.g., sialic acid, CMP-sialic 
acid, ManNAc, ManNAc-6-phosphate) in the CMP-SA production pathway are 
measured using this form of chromatography, essentially as described by Lee et ah 
(1990) Anah Biochem. 34:953-957, Leeefa/. (1996) J. Chromatography A 720:137- 
149, Hardy et ah (1988) Anah Biochem. 170:54-62, Townsend et ah (1988) Anah 
Biochem. 174:459-470, Kiang et ah (1997) Anah Biochem. 245:97-101. 

Reverse phase High Performance Liquid Chromatography (HPLC)for 
sialyltransferase 

[0321] To detect native sialyltransferase enzyme activity, Trichoplusia ni 

lysates were incubated in the presence of exogenously added CMP-SA and the 
fluorescent substrate, 4-methylumbelliferyl lactoside (Lac-MU). Negligible conversion 
of the substrate was observed, indicating the absence of endogenous sialyltransferase 
activity. However, following infection of the insect cells with a baculo virus encoding 
human alpha2-3 -sialyltransferase, conversion of Lac-MU to the product sialyl LacMU 
was observed in cell lysates using Reverse Phase HPLC and a fluorescence detector 
(Figure 10). For higher sensitivity, Lac-PA (PA=2-aminopyridine) or Lac-ABA 
(ABA=o-aminobenzamide) are used as substrates. HPLC and HPAEC is used in 
conjunction with other fluorometric methods detailed in the procedures to analyze the 
metabolites and enzymatic activities in insect cells. 

Dissociation Enhanced Lanthananide FluorommunoAssay (DELFIA) for GalT 
[0322] The previous chromatography techniques have one limitation in that 

only one sample can be handled at a time. When samples from several cell lines must 
be assayed, a method such as DELFIA is advantageous since a multiwell fluorometer 
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can simultaneously examine activities in many samples on a microtiter plate (Hemmila 
et al. (1984) Anal. Biochem. 137:335-343). The application of such a technique for the 
measurement of Gal T activity in several different insect cell lysates and controls is 
shown in Figure 11. First, the wells of the microtiter plate are coated with the substrate 
GlcNAc-BSA (Stowell et al. (1993) Meth. in Carb. Chem. 9:178-181). After 
incubation with Gal T and UDP-Gal, the well is washed and the Gal residue newly 
attached to GlcNAc-BSA is measured with europium (Eu +3 )-labeled Ricinus cummunis 
lectin, which specifically binds Gal or GalNAc structures. The sensitivity of Eu 
fluorescence under appropriate conditions can reach the fmol range and match or 
eclipse that of radioiodides (Kawasaki et al. (1997) Anal. Biochem. 250:260-262). 
[0323] Figure 1 1 depicts GlcNAc-BSA in (A) Boiled lysate; (B) T. ni; (C) 

Standard enzyme, 0.5 mU; (D) T. ni insect cells infected with a baculovirus coding for 
GalT (E) Sf-9 cells stably transfected with GalT gene. The increase in Gal T activity in 
untreated cell lysates (B in Figure 1 1) relative to boiled lysates (A) indicates that T. ni 
cells have low but measurable endogenous Gal T activity. The Gal T activity level is 
increased significantly following infection with a baculovirus vector including a 
mammalian Gal T gene under the IE1 promoter or by using Sf-9 cells stably- 
transformed with the Gal T gene (cell lines are described in Jarvis et al. (1996) Nature 
Biotech. 14:1288-1292; and Hollister et al. (1998) Glycobiology 5:473-480). 
[0324] The DELFIA method is not limited to Gal T measurement. This 

technique is used to evaluate the activity of any processing enzyme which generates 
carbohydrate structures containing binding sites for a specific lectin or carbohydrate- 
specific antibodies (Taki et al. (1994) Anal. Biochem. 219:104-108, Rabina et al. 
(1997) Anal. Biochem. 246:459-470). 

Exam ple 2: Enhancing SA levels by Substrate Addition 
[0325] Because the conventional substrates in insect cell media are not 

efficiently converted to CMP-SA in insect cells as demonstrated by the low levels of 
CMP-SA, alternative substrates are added to the culture medium. Because sialic acid 
and CMP-SA are not permeable to cell membranes (Bennetts et al. (1981) J. Cell. Biol. 
88:1-15), they are not considered as appropriate substrates. However, other precursors 
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in the CMP-SA pathway are incorporated into cells and considered as substrates for the 
generation of CMP-SA in insect cells. 

Incorporation and conversion of N-acetylmannosamine (ManNAc) 
[0326] ManNAc has been added to mammalian tissue and cell cultures and 

enzymatically converted to SA and CMP-SA (Ferwerda et al. (1983) Biochem. J. 
216:87-92, Gu et al. (1997) Improvement of the interferon-gamma sialylation in 
Chinese hamster ovary cell culture by feeding N-acetylmannosamine, Thomas et al. 
(1985) Biochim. Biophys. Acta 846:37-43). Consequently, external feeding of 
ManNAc is examined as one strategy to enhance CMP-SA levels in insect cells. 
ManNAc is available commercially (Sigma Chemical Co.) or can be prepared 
chemically from the less expensive feedstock GlcNAc in vitro using sodium hydroxide 
(Mahmoudian et al. (1997) Enzyme and Microbial Technology 20:393-400). Initially, 
the levels of native cellular ManNAc, if any, is determined using HPAEC-PAD 
techniques (Lee et al. (1990) Anal. Biochem. 34:953-957, Lee et al. (1996) J. 
Chromatography A 720:137-149, Hardy et al. (1988) Anal. Biochem. 170:54-62, 
Townsend et al. (1988) Anal. Biochem. 174:459-470, Kiang et al. (1997) Anal. 
Biochem. 245:97-101). The ability to increase intracellular ManNAc levels is 
evaluated by adding ManNAc to cell culture media. Incorporation of exogenous 
ManNAc is quantified using unlabeled ManNAc if levels of native ManNAc are 
negligible, or 14 C- or 3 H-labeled ManNAc if significant levels of native ManNAc are 
present) (Bennetts et al. (1981) J. Cell. Biol. 88:1-15, Kriesel et al. (1988) J. Biol. 
Chem. 263:1 1736-1 1742). The levels of radioactive ManNAc and other metabolites are 
determined by collecting ManNAc peaks following HPAEC and measuring the 
radioactivity using scintillation countering. 

[0327] To be effective as a substrate for sialylation, the ManNAc must be 

converted to SA and CMP-SA through intracellular pathways. This conversion is 
detected directly from externally added ManNAc by following an increase in internal 
SA and CMP-SA levels using HPAEC or thin layer chromatography (TLC) combined 
with liquid scintillation counting to detect the radiolabeled metabolites. HPAEC 
techniques have been used to quantify cellular pools of CMP-SA in as few as 6 x 10 6 
mammalian cells (Fritsch et al. (1996) Journal of Chromatography A 727 ':223-230), 



68 



and TLC has been used to evaluate conversion of 14 C labeled ManNAc to sialic acid in 
bacteria (Vann et al. (1997) Glycobiology 7:697-701). If the addition of ManNAc leads 
to a significant increase in the CMP-SA levels, a limiting step exists in the production 
of ManNAc from conventional insect cell media substrates. Different ManNAc feeding 
concentrations are tested and the effect on CMP-SA levels and insect cell viability 
evaluated to determine if there are any deleterious effects from feeding the ManNAc as 
substrate. Conversion of ManNAc to SA through the aldolase pathway requires 
pyruvate, and the addition of cytidine can enhance CMP-SA production from SA 
(Thomas et al. (1985) Biochim. Biophys. Acta 846:37-43). Thus, pyruvate and cytidine 
are optionally added to the medium to enhance conversion of ManNAc to CMP-SA 
(Tomita et al. (1995) Biochim. Biophys. Acta 1243:329-335, Thomas et al. (1985) 
Biochim. Biophys. Acta 846:37-43). 



Alternative Substrates 
[0328] Other precursors substrates such as N-acetylglucosamine (GlcNAc) and 

glucosamine are converted to S A and CMP-SA through the ManNAc pathway in 
eukaryotic cells (Pederson et al. (1992) Cancer Res. 52:3782-3786, Kohn et al. (1962) 
J. Biol. Chem. 237:304-308). The disposition of these alternative precursor substrates 
are monitored using HPAEC analysis using techniques known in the art and compared 
with ManNAc feeding strategies to determine which substrate provides for the most 
efficient production of CMP-SA, in particular insect cells. 

Exam ple 3: Purification and cloning of CMP-SA synthetase 
[0329] A bioinformatics search of the cDNA libraries of HGS revealed a novel 

human CMP-sialic acid synthetase (CMP-SA synthetase, or CMP-SAS) gene based on 
its homology with the E. coli DNA sequence. The bacterial enzyme includes a 
nucleotide binding site for CTP. This binding site contains a number of amino acids 
that are conserved among all known bacterial CMP-SAS enzymes (See Stoughton et 
al, Biochem J. 15:397-402 (1999). The identity of the human cDNA as a CMP-SA 
synthetase gene was confirmed by the presence of significant homology within this 
binding motif: 
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bacterial sequence: IIAIIPARSGSKGL 
identity/homology + A+I AR GSKG+ 

human CDNA: LAALILARGGSKGI 



[0330] This human homologue commercially, publicly, or otherwise available 

for the purposes of this invention is cloned and expressed in insect cells. The 
nucleotide and amino acid sequences of human CMP SA synthetase are shown in 
Figures 29 and 30 respectively. The characterization of CMP-SA synthetase, and the 
use of CMP-SAS, as well as sialic acid synthetase (SAS), in engineering the sialic acid 
metabolic pathway is described in greater detail in Example 7, below. 

Example 4: Isolation and Inhibition of glucosam inidase 
[0331] It is recognized that insect cells could possess additional N- 

acetylglucosaminidase enzymes other than the enzyme responsible for generating low- 
mannose structures, so both recombinant DNA and biochemical approaches are 
implemented to isolate the target N-acetylglucosaminidase gene. PCR techniques are 
used to isolate fragments of N-acetylglucosaminidase genes by the same strategies used 
in isolating a/pAa-mannosidase cDNAs from Sf-9 cells (Jarvis et al. (1997) 
Glycobiology 7:1 13-127, Kawar et al. (1997) Glycobiology 7:433-443). Degenerate 
oligonucleotide primers are designed corresponding to regions of conserved amino acid 
sequence identified in all N-acetylglucosaminidases described thus far, from human to 
bacteria, including two lepidopteran insect enzymes (Zen et al. (1996) Insect Biochem. 
Mol. Biol. 26:435-444). These primers are used to amplify a fragment of the N- 
acetylglucosaminidase gene(s) from genomic DNA or cDNA of lepidopteran insect cell 
lines commercially, publicly, or otherwise available for the purposes of this invention. 
A putative N-acetylglucosaminidase gene fragment from Sf9 genomic DNA and from 
High Five™ cell (Invitrogen Corp., Carlsbad, CA, USA) cDNA has been identified 
(Figure 18). Similar techniques are used to isolate cDNAs from other insect cell lines 
of interest. The identification of cDNAs for the Sf9 or High Five™ N- 
acetylglucosaminidase facilitates the isolation of the gene in other insect cell lines. 
[0332] Figure 1 8 depicts PCR amplification of Sf9 genomic DNA (A) or High 

Five™cell cDNA (B) with degenerate primers corresponding to three different regions 
conserved within N-acetylglucosaminidases. These regions were designated 1, 2, and 



70 



3, from 5 to 3'. Lane 1 (sense primer 1 and antisense primer 2); Lanes 2 (sense primer 1 
and antisense primer 3); Lanes 3 (sense primer 2 and antisense primer 3). M (size 
standards with sizes indicated in Kbp). The results show that specific fragments of N- 
acetylglucosaminidase genes were amplified from both DNAs (lanes A2 and B3). The 
specificity of the reactions is indicated by the fact that different primer pairs produced 
different amplification products from different templates. The primer sequences 
utilized in amplifying the putative N-acetylglucosaminidase gene were as follows: 

Sense primer #1 : 5 , -T/C > T 3 I 5 C,A,C/T,T,G,G,C,A,C/T,A/T/C,T,I,G ) T,I,G,A-3' (SEQ ID 
NO:9) 

Sense primer #2: S'-GA.G/A^TJ.A/C/T^A.C/TJJJ.CCJ^^/CI.CA-S' (SEQ 
ID NO: 10) 

Antisense primer #2: S'-T.G.LC/CC^CCLL^G/AJ^^/G/A^J/A.C/T.T.C-S' 
(SEQIDNO:ll) 

Antisense primer #3: 5'-A,C/A/G,C/T,T 5 C J G/A,T,C,I,C,C,I,C,C,I,I,I,G/A,T,G-3' 
(SEQ ID NO: 12) 

[0333] The PCR amplified fragments are cloned and sequenced using the chain 

termination method (Sanger et al. (1977) Proc. Natl. Acad. Sci. USA 74:5463-5467). 
The results are used to design exact-match oligonucleotide primers to isolate an N- 
acetylglucosaminidase clone(s) from existing Sf9 and/or High Five™ lambda ZAPH 
cDNA libraries by sibling selection and PCR (Jarvis et al. (1997) Glycobiology 7:1 13- 
127, Kawar et al. (1997) Glycobiology 7:433-443). The library is consecutively split 
into sub-pools that score positive in PCR screens until a positive sub-pool of 
approximately 2,000 clones is obtained. These clones are then screened by plaque 
hybridization (Benton et al. (1977) Science 196:180-182) using the cloned PCR 
fragment, and positive clones are identified and plaque purified. The cDNA(s) are then 
excised in vivo as a pBluescript-based subclone in E. coli. 
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Isolation ofN-acetylglucosaminidases using biochemical techniques 
[0334] Since insect cells may have multiple N-acetylglucosaminidases, a 

biochemical purification approach is also used to broaden the search for the cDNA 
encoding the target enzyme. A polyclonal antiserum against a Manduca sexta N- 
acetylglucosaminidase (Koga et al. (1983) Manduca sexta Comparative Biochemistry 
and Physiology 74:515-520) is used to examine Sf9 and High Five™ cells for cross- 
reactivity. This antiserum is used to probe for the N-acetylglucosaminidase during 
biochemical isolation techniques. In addition, specific assays for N- 
acetylglucosaminidase described earlier are used to monitor enzyme activity in isolated 
biochemical fractions. 

[0335] The target N-acetylglucosaminidase is membrane bound, so it must be 

solubilized using a detergent such as Triton-X 100 prior to purification. Once 
solubilized, the enzyme is purified by a combination of gel filtration, ion exchange, and 
' affinity chromatography. For affinity chromatography, the affinants 6-aminohexyl thio- 
N-acetylglucosaminide (Chipowsky et al. (1973) Carbohydr. Res. 31:339-346) or BSA 
modified with thio-N-acetylglucosaminide (Lee et al. (1976) Biochemistry 15:3956- 
3963) is tried first. If necessary, 6-aminohexyl a-D-[2-(thio-2-amino-2-deoxy-b-D- 
glucosaminyl)-mannopyranodside or other thio-oligosaccharides are synthesized and 
used as affinants. Affinity matrices are prepared using commercially available 
products. 

[0336] Alternatively, the target enzyme is "anchored" to the membrane by a 

glycophosphoinositide. In such a case, a specific phospholipase C is used to release the 
active enzyme from the membrane, and the use of detergent for solubilization is 
avoided. 

[0337] The purity of the enzyme is examined with SDS-PAGE and mass 

spectroscopy, and the activity of the enzyme characterized. Once the enzyme is 
sufficiently purified, its amino-terminal region is sequenced by conventional Edman 
degradation techniques, available commercially. If N-terminal blockage is encountered, 
the purified protein are digested, peptides purified, and these peptides are used to obtain 
internal amino acid sequences. The resulting sequence information is used to design 
degenerate oligonucleotide primers that are used, in turn, to isolate cDNAs as described 
above. 
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Expression of glucosaminidase 
[0338] Isolated full-length cDNAs are sequenced, compared to other N- 

acetylglucosaminidase cDNAs, and expressed using known polyhedrin-based 
baculovirus vectors. The overexpressed proteins are purified, their biochemical 
activities and substrate specificities characterized, and new polyclonal antisera is 
produced to establish the subcellular locations of the enzymes in insect cells. The 
locations are optionally identified by using the antisera in conjunction with secretory 
pathway markers, including Golgi and endoplasmic reticulum specific dyes and GFP- 
tagged N-glycan processing enzymes commercially, publicly, or otherwise available for 
the purposes of this invention. Evaluation of the N-glycan structures on secreted 
glycoproteins from insect cells overexpressing the glucosaminidase gene demonstrates 
the involvement of this enzyme in N-glycan processing as opposed to lysosomal 
degradation, a common activity for other glucosaminidases. 

Example 5: Expression of the model glycoprotein transferrin 
[0339] The gene encoding human transferrin as described in Genbank accession 

No. S95936 is cloned into the baculovirus vector, expressed in multiple insect cell 
lines, and purified to homogeneity. Figure 26 shows SDS-PAGE of transferrin from 
insect cells (M=unpurified lysates, P=purified protein). Similar techniques are used to 
express and purify this glycoprotein in the target cell line(s) of interest following 
manipulation of the glycosyltransferase and CMP-SA production pathways. 
[0340] Once the transferrin is purified to homogeneity, the structures of the 

oligosaccharides which are N-linked at two sites of the transferrin are analyzed using 3- 
dimensional HPLC mapping techniques. Over 450 N-glycans have been mapped with 
this technique. For example, characterization of the N-linked oligosaccharides attached 
to the heavy chain of secreted and intracellular IgG is described. Confirmation of 
particular carbohydrate structures is provided by treating the oligosaccharides with 
glycosidases and re-eluting through the HPLC columns. Additional structural 
information on unknown oligosaccharides are obtained using mass spectrometry and 
NMR techniques previously used for analysis of IgG glyco forms (Hsu et al (1997) J. 
Biol Chem. 272:9062-9070). 
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in 



[0341] These analytical techniques allow the identification and quantification of 

N-glycans to determine if a fraction of these structures are sialylated oligosaccharides. 
Sialylation is confirmed by treating the purified N-glycan with sialidase from A. 
ureafaciens and measuring the release of sialic acid using HPAEC-PAD. 
[0342] The present invention now will be described more fully hereinafter with 

reference to the accompanying drawings, in which preferred embodiments of the 
invention are shown. This invention may, however, be embodied in many different 
forms and should not be construed as limited to the embodiments set forth herein; 
rather, these embodiments are provided so that this disclosure will be thorough and 
complete, and will fully convey the scope of the invention to those skilled in the art. 
Like numbers refer to like elements throughout. 

[0343] Many modifications and other embodiments of the invention will come 

to mind to one skilled in the art to which this invention pertains having the benefit of 
the teachings presented in the foregoing descriptions and the associated drawings. 
Therefore, it is to be understood that the invention is not to be limited to the specific 
embodiments disclosed and that modifications and other embodiments are intended to 
be included within the scope of the appended claims. Although specific terms are 
employed herein, they are used in a generic and descriptive sense only and not for 
purposes of limitation. 

Exam ple 6: Cloning, expression, and characteriza tion of the human sialic acid 
synthetase (SAS) gene and gene product. 

[0344] This example reports the cloning and characterization of a novel human 

gene having homology to the Escherichia coli sialic acid synthetase gene (neuB). This 
human gene is ubiquitously expressed and encodes a 40 kD enzyme which results in N- 
acetylneuraminic acid (Neu5Ac) and 2-keto-3-deoxy-D-g/vcero-D-gatacto-nononic 
acid (KDN) production in insect cells upon recombinant baculovirus infection. In vitro 
the human enzyme uses A^acetylmaimosamine-6-phosphate and mannose-6-phosphate 
as substrates to generate phosphorylated forms of Neu5Ac and KDN, respectively, but 
exhibits much higher activity toward the Neu5Ac phosphate product. 
[0345] In order to identify genes involved in sialic acid biosynthesis in 

eukaryotes, homology searches of a human expressed sequence tag (EST) database 
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were performed using the E. coli sialic acid synthetase gene. A cDNA of 
approximately 1 kb with a predicted open reading frame (ORF) of 359 amino acids was 
identified. Northern blot analysis indicated that the mRNA is ubiquitously expressed, 
and in vitro transcription and translation along with recombinant expression in insect 
cells demonstrated that the human sialic acid synthetase {SAS) gene encodes a 40 kD 
protein. SAS rescued an E. coli neuB mutant although less efficiently than neuB. 
Neu5 Ac production in insect culture supplemented with ManNAc further supported the 
role of SAS in sialic acid biosynthesis. In addition to Neu5Ac, a second sialic acid, 
KDN, was generated suggesting that the human enzyme has broad substrate specificity. 
The human enzyme (SAS), unlike its E. coli homologue, uses phosphorylated 
substrates to generate phosphorylated sialic acids and thus likely represents the 
previously described sialic acid-9-phosphate synthetase of mammalian cells (Watson et 
al., J. Biol. Chem. 241, 5627-5636 (1966)). 

Identification of a Human Sialic Acid Synthetase Gene 
[0346] The E. coli sialic acid synthetase gene (Annunziato et al., J. Bacteriol. 

1 77, 312-319 (1995)) was used to search the human EST database of Human Genome 
Sciences, Inc. (Rockville, MD). One EST with significant homology to the neuB gene 
was found in a human liver cDNA library and used to identify a full length cDNA 
(Figure 3 5 A) with an ORF homologous to the bacterial synthetase over most of its 
length. The putative synthetase consisted of 359 amino acids (SEQ ID NO:6) while the 
neuB gene product contained 346 amino acids (SEQ ID NO:8). Alignment of the 
human against the bacterial enzyme demonstrated that significant differences were 
found primarily in the N-terminus (Figure 35B). Overall, the two synthetases were 
found to be 36.1% identical and 56.1% similar at the amino acid level. 
[0347] The product of a cDNA amplification with a T7 promoter was expressed 

by in vitro transcription and translation using rabbit reticulocyte lysates. The 
generation of an -40 kD protein, consistent with a predicted molecular weight of 40.3 
kD, confirmed the existence of an ORF (Figure 36A, lane 2). The negative control, 
namely the vector without an insert, did not produce a protein product (Figure 36A, 
lane 1). Northern blot analysis was performed on poly-A+ RNA blots representing a 
selection of human tissues (Figure 36B). The full-length cDNA was radio-labeled and 
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used as probe. A band of expected size, -1 .3 kb, was observed in all tissues tested 
suggesting that the putative synthetase is ubiquitously expressed. 

Expression and Purification of Human Sialic Acid Synthetase 
[0348] SAS was inserted into baculovirus under the polh promoter using lacZ as 

a positive selection marker. After transfection and viral titering, the resulting virus 
(AcS AS) was used to infect Spodoptera frugiperda (Sf-9) cells followed by pulse 
labeling. An -40 kD band was observed in the Sf-9 lysates from cells infected by 
AcSAS (Figure 36A, lane 5) and not in the mock infected control (Figure 36A, lane 4). 
Furthermore, this band co-migrated with the protein produced in vitro. To verify SAS 
expression, the band was visualized in the non-nuclear fraction (Miyamoto et al., Mol 
Cell Biol 5, 2860-2865 (1985)) after electrophoretic transfer to a ProBlott™ 
membrane and Ponceau S staining (data not shown) and excised for amino acid 
sequencing. The five N-terminal amino acids were identical to the second through 
sixth amino acids of the predicted protein (data not shown). Interestingly, the initiator 
methionine was also removed from the purified recombinant E. coli sialic acid 
synthetase (Vann et al., 1997). 

In Vivo Activity of Human Sialic Acid Synthetase 
[0349] Covalent labeling of sialic acids with the fluorescent reagent 1 ,2- 

diamino-4,5-methylene dioxybenzene dihydrochloride (DMB) allows very specific and 
sensitive sialic acid detection (Hara et al., Anal Biochem. 1 79, 162-166 (1989); Manzi 
et al., Anal Biochem. 188, 20-32 (1990)). The DMB reaction products are identified 
after separation by reverse phase HPLC chromatography. Using this technique, sialic 
acid standards were measured in quantities as low as 50 ftnol (data not shown). Sialic 
acid levels of an insect cell line (Sf-9) and a mammalian cell line (Chinese hamster 
ovary, CHO) were compared (Table 2). The sialic acid content in cell lysates before 
and after filtration through a 10,000 MWCO membrane was determined by DMB 
labeling and HPLC separation. The native sialic acid levels in Sf-9 cells grown without 
fetal bovine serum (FBS) supplementation are substantially lower than the levels found 
in CHO cells (Table 2; Figure 37A). To ensure that the low sialic acid content was not 
due to the absence of serum, the sialic acid content of insect cells cultured in 10% FBS 
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was determined. Even with FBS addition, the NeuSAc content of Sf-9 cells is nearly an 
order of magnitude lower than the content of CHO cells (Table 2). The origin of the 
sialic acid detected in insect cells, whether natively produced or the result of 
contamination from the media, is not clear since even serum free insect cell media 
contains significant levels of sialic acid (data not shown). 



Table 2. Sialic Acid Content of CHO and Sf-9 Cell Lines 

KDN (ftnol/ng protein) NeuSAc (fmol/|ag protein) 

+ Filtration - Filtration + Filtration - Filtration 



Sf-9 + FBS - - 80 600 

CHO 70 100 900 4,200 

CHO and Sf-9 cells were grown to confluency in T-75 flasks. Cell lysates with and 

without 10,000 MWCO filtration were analyzed for sialic acid content following DMB 

derivatization and HPLC separation. Sialic acid levels have been normalized based on 

lysate protein content. Dashes indicate sialic acid was not detectable. 



[0350] The lack of large sialic acid pools in Sf-9 cells grown in serum-free 

media facilitated the detection of sialic acids produced by recombinant enzymes. In 
order to examine the production of sialic acids from cells infected with recombinant 
virus, Sf-9 cells were infected with AcSAS and a negative control virus, A35. The A35 
virus was generated by recombining a transfer vector without a gene inserted 
downstream of the polh promoter. Low levels of NeuSAc were observed in lysates 
from insect cells infected by either virus (Figure 37B) indicating additional NeuSAc 
was not produced following the expression of SAS. However, a significant new peak 
was seen in AcSAS lysates at 12.5 min. that was not observed in A35 negative control 
lysates (Figure 37B). Published chromatograms suggested the unknown early eluting 
peak could be 7V-glycolylneuraminic acid (NeuSGc) or KDN (Inoue et al., 1998). The 
elution time of the unknown peak was the same as DMB-derivatized KDN standard 
(Figure 37B) and co-chromatographed with authentic DMB-KDN (data not shown) 
confirming KDN generation in AcSAS infected Sf-9 cells. KDN was not detected in 
uninfected Sf-9 cells either with or without FBS supplementation (Table 2). 
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[0351] In a further attempt to demonstrate Neu5Ac synthetic functionality, the 

culture media was supplemented with ManNAc, the metabolic precursor of Neu5Ac. In 
addition to a DMB-KDN peak, a prominent peak eluting at 17.5 min. corresponding 
with that of the Neu5Ac standard was observed from the lysates of ManNAc 
supplemented Sf-9 cells infected with AcSAS (Figure 37C). Neu5Ac quantities were 
more than 100 times lower in the uninfected lysates and even less in A35 infected 
lysates (Table 2). 

[0352] Sialic acid levels were quantified in lysates of uninfected, A35 infected, 

and AcSAS infected Sf-9 cells grown in media with and without Man, mannosamine 
(ManN), or ManNAc supplementation (Table 3). In uninfected cells, Man feeding 
resulted in detection of KDN slightly above background, and ManNAc feeding 
marginally increased Neu5Ac levels in uninfected and A35 infected cells (Table 3). 
ManN supplementation had no effect on KDN levels but increased Neu5 Ac levels 
(Table 3). The most significant changes in sialic acid levels occurred with AcSAS 
infection. AcSAS infection of Sf-9 cells led to large increases in KDN levels with 
slight enhancements upon Man or ManNAc supplementation. Both AcSAS infection 
and ManNAc feeding were required to obtain substantial Neu5 Ac levels. 



Table 3. Sialic Acid Content of Sf-9 with Media Supplementation 



KDN (fmol/ue protein) Neu5Ac (fmol/ug protein) 



Feeding: 


None 


Man 


ManN 


ManNAc 


None 


Man 


ManN 


ManNAc 


No Infection 




20 






30 


20 


60 


140 


A35 










80 


80 


100 


120 


AcSAS 


5,300 


7,600 


5,200 


6,300 


50 


40 


200 


27,000 



Uninfected, A35 infected, and AcSAS infected Sf-9 cells were grown in unsupplemented media 
and media that was supplemented with 10 mM Man, ManN, or ManNAc. Cell lysates were 
analyzed for KDN and Neu5Ac content using DMB derivatization and HPLC separation. Sialic 
acid levels have been normalized based on lysate protein content. Dashes indicate sialic acid 
was not detectable. 



[0353] The presence of KDN and Neu5 Ac in AcSAS lysates has been 

confirmed by high-performance anion-exchange chromatography (HPAEC) with a 
pulsed amperometric detector (Figure 37D). When culture media is supplemented with 
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ManNAc, peaks with elution times corresponding to authentic KDN and NeuSAc 
standards are seen in AcSAS infected lysates that are absent in A35 infected lysates. 
NeuSAc aldolase has been demonstrated previously to break Neu5 Ac into ManNAc and 
pyruvic acid (Comb and Roseman, 7. Biol Chem. 235, 2529-2537 (I960)) and KDN 
into Man and pyruvic acid (Nadano et al., J. Biol Chem, 261, 1 1550-1 1557 (1986)). 
KDN and NeuSAc disappear from the AcSAS lysates after aldolase treatment (Figure 
37D). A similar disappearance of the sialic acid peaks following aldolase treatment 
was observed using DMB-labeling and HPLC analysis (data not shown). 

In Vitro Activity of Human Sialic Acid Synthetase 
[0354] The mammalian pathway for NeuSAc synthesis uses a phosphate 

intermediate (Jourdian et al., J. Biol Chem, 239, PC2714-PC2716 (1964); Kundig et 
al., J. Biol Chem, 241, 5619-5626 (1966); Watson et al., J. Biol Chem, 241, 5627- 
5636 (1966)) while the E. coli pathway directly converts ManNAc and PEP to NeuSAc 
(Vann et al., Glycobiology 7, 697-701 (1997)). In order to determine which substrates 
are used by the human enzyme, in vitro assays were performed using lysates of infected 
Sf-9 cells and protein purified from the prokaryotic expression system. Lysates or 
purified protein plus PEP and MnCl 2 (Angata et al., 7. Biol Chem. 274, 22949-22956 
(1999)) were incubated with Man, mannose-6-phosphate (Man-6-P), ManNAc, or 
ManNAc-6-P followed by DMB labeling and HPLC analysis. 
[0355] AcSAS infected cell lysates incubated with ManNAc-6-P and PEP 

produced a peak eluting at 5.5 min (Figure 38A) consistent with phosphorylated sugars. 
In previous studies, phosphorylated KDN was detected as DMB-KDN after alkaline 
phosphatase (AP) treatment and DMB derivatization (Angata et al., J. Biol Chem. 274, 
22949-22956 (1999)). Similarly, the peak eluting at 5.5 min. was exchanged for one 
that eluted at the same time as authentic NeuSAc following AP treatment (Figure 38A). 
Likewise, an early eluting peak from the incubation mixture containing Man-6-P 
yielded a KDN peak after AP treatment (Figure 38B). No sialic acid products were 
detected when A3 5 infected cell lysates were used in the equivalent assays or when 
Man or ManNAc were used as substrates (data not shown). 

[0356] Assays were performed by incubating lysates with different substrate 

solution concentrations of Man-6-P and ManNAc-6-P in order to evaluate substrate 
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preference. After incubation for a fixed time period, the samples were treated with AP, 
and DMB derivatives of Neu5Ac and KDN were quantified and compared (Table 4). 
When equimolar amounts of substrates are used, Neu5Ac production is significantly 
favored over KDN especially at higher equimolar concentrations (10 and 20 mM) of the 
two substrates. Only when the substrate concentration of ManNAc-6-P is substantially 
lower than the Man-6-P levels are production levels of the two sialic acids comparable. 
When the ManNAc-6-P concentration is 1 mM and the Man-6-P level is 20 mM, the 
Neu5Ac:KDN production ratio approaches unity. Therefore, the enzyme prefers 
ManNAc-6-P over Man-6-P in the production of phosphorylated forms of Neu5Ac and 
KDN, respectively. 

Table 4. Competitive Formation of Neu5Ac and KDN 

Concentration in Substrate Solution (mM) Final Concentration Neu5Ac/KDN 

(pmol/ul) Ratio 
Man-6-P ManNAc-6-P KDN Neu5Ac 



1 


1 


8 


33 


4.2 


5 


1 


19 


47 


2.5 


10 


1 


33 


53 


1.6 


20 


1 


56 


60 


1.1 


5 


5 


14 


190 


14 


10 


10 


18 


440 


24 


20 


20 


16 


820 


51 


20 


5 


40 


300 


7.6 


20 


10 


18 


470 


25 



Lysates from AcSAS infected Sf-9 cells were incubated with substrate solutions 
containing the indicated concentrations of Man-6-P and ManNAc-6-P. After incubation 
and AP treatment, samples were analyzed for KDN and Neu5 Ac content using DMB 
derivatization and HPLC separation. Neu5Ac and KDN concentrations of the final 
solution (50 ul) and the Neu5Ac/KDN ratio are reported. 
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Discussion of Human Sialic Acid Synthetase Characterization 
[0357] We have identified the sequence of a human sialic acid phosphate 

synthetase gene, SAS, whose protein product condenses ManNAc-6-P or Man-6-P with 
PEP to form Neu5 Ac and KDN phosphates, respectively. To our knowledge, this is the 
first report of the cloning of a eukaryotic sialic acid phosphate synthetase gene. Despite 
the importance of sialic acids in many biological recognition phenomena, sialic acid 
phosphate synthetase genes have not been cloned because the enzymes they encode are 
unstable and difficult to purify (Watson et al., J. Biol. Chem. 241, 5627-5636 (1966); 
Angata et al., J. Biol. Chem. 21 A, 22949-22956 (1999)). Even the E. coli sialic acid 
synthetase enzyme, whose sequence is known, has low specific activity and is unstable 
(Vann et al., Glycobiology 7, 697-701 (1 997)). 

[0358] Consequently, a bioinformatics approach based on the E. coli synthetase 

sequence was used to identify a putative human gene 36% identical and 56% similar to 
neuB. In vitro transcription and translation verified an open reading frame which 
encoded a 359 amino acid protein. In addition, Northern blots revealed ubiquitous 
transcription of the human synthetase gene in a selection of human tissues. The wide 
distribution of SAS mRNA is consistent with the detection of sialic acids in many 
different mammalian tissues (Inoue and Inoue, Sialobiology and Other Novel Forms of 
Glycosylation (Osaka, Japan: Gakushin Publishing) pp.57-67 (1999)). 
[0359] Using the baculovirus expression system, the 40 kD sialic acid 

phosphate synthetase enzyme, SAS, was expressed in cells. The use of Sf-9 cells which 
have little if any native sialic acid greatly facilitated the detection of sialic acids and the 
characterization of SAS. However, Neu5Ac was observed only when insect cells were 
infected with AcSAS and the cell culture media was supplemented with ManNAc, a 
sialic acid precursor. This ManNAc feeding requirement indicates that Sf-9 cells may 
lack sizeable ManNAc pools and synthetic pathways. 

[0360] SAS was identified based on homology with neuB whose enzyme 

product directly forms Neu5 Ac from ManNAc and PEP (Vann et al., Glycobiology 7, 
697-701 (1997)). Furthermore, insect cells produce Neu5Ac following recombinant 
SAS expression and ManNAc supplementation. However, mammalian cells are known 
only to produce Neu5Ac from ManNAc through a three-step pathway with 
phosphorylated intermediates. Therefore, in vitro assays were performed to determine 
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the substrate specificity of SAS. Both AcSAS infected insect cell lysates and protein 
purified from the prokaryotic expression system were assayed using ManNAc and 
ManNAc-6-P as possible substrates. A rapidly eluting DMB derivatized product, 
typical of a phosphorylated sialic acid, was observed only when ManNAc-6-P was used 
as the substrate. Furthermore, this peak disappears with the appearance of an 
unsubstituted DMB-Neu5 Ac peak following AP treatment. SAS therefore condenses 
PEP and ManNAc-6-P to form a Neu5Ac phosphate product. Although the exact 
position of the phosphorylated carbon on the product has not yet been specified, SAS is 
likely the sialic acid phosphate synthetase enzyme of the previously described three- 
step mammalian pathway (Kundig et al., J. Biol. Chem. 241, 5619-5626 (1966); 
Watson et al., J. Biol. Chem. 241, 5627-5636 (1966); Jourdian et al., J. Biol. Chem. 
239, PC2714-PC2716 (1964)). Despite little if any native pools of sialic acids, Sf-9 
cells natively possess the ability to complete the three-step mammalian pathway when 
only the sialic acid phosphate synthetase gene is provided. Sf-9 cells have been shown 
to have substantial ManNAc kinase ability (Effertz et al., J. Biol. Chem. 274, 28771- 
28778 (1999)), and phosphatase activity has also been detected in insect cells 
(Sukhanova et al., Genetika 34, 1239-1242 (1998)). 

[0361] The capacity to produce sialic acids in Sf-9 cells following AcSAS 

infection and ManNAc supplementation at levels even higher than those seen in a 
mammalian cell lines such as CHO may help overcome a major limitation of the 
baculovirus expression system. N-glycans of recombinant glycoproteins produced in 
insect cells lack significant levels of terminal sialic acid residues (Jarvis and Finn, 
Virology 212, 500-511 (1995); Ogonah et al., Bio/Technology 14, 197-202 (1996)). 
The lack of sialylation on human thyrotropin produced by the baculovirus expression 
system resulted in rapid in vivo thyrotropin clearance as compared to thyrotropin 
produced by a mammalian system (Grossmann et al., Endocrinology 138, 92-100 
(1997)). Generation of significant sialic acid pools along with expression of other 
genes such as sialyltransferases may lead to production of significant levels of 
sialylated glycoproteins in insect cells. 

[0362] Another interesting observation was the occurrence of a second DMB 

reactive peak in AcSAS infected Sf-9 lysates. This peak has been identified as KDN, a 
deaminated Neu5Ac. We subsequently demonstrated that the SAS enzyme generates 
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KDN phosphate from Man-6-P and PEP in vitro. While Neu5 Ac production in insect 
cells requires both AcSAS infection and ManNAc supplementation, only AcSAS 
infection is necessary for KDN synthesis. Therefore, significant substrate pools for the 
generation of KDN already exist in insect cells or are present in the media. In addition, 
mannose feeding increased KDN production even further. Interestingly, Man feeding 
of the uninfected insect cells increased KDN levels above background, and ManNAc 
feeding also led to higher Neu5 Ac levels in uninfected cells. Therefore, insect cells 
may possess limited native sialic acid synthetic ability. Similar substrate 
supplementation results have been reported in mammalian cells, as cultivation in Man- 
rich or ManNAc-rich media enhanced the synthesis of native intracellular KDN and 
NeuSAc, respectively (Angata et al., Biochem. Biophys. Res. Commun. 261, 326-331 
(1999)). 

[0363] This study is the first report of a eukaryotic gene encoding any enzyme 

with KDN synthetic ability. Recently, KDN enzymatic activity has been characterized 
in trout testis, a tissue high in KDN content. KDN is synthesized from Man in trout 
through a three-step pathway involving a synthetase with a Man-6-P substrate (Angata 
et al., J, Biol Chem. 274, 22949-22956 (1999)). However, the fish synthetase enzyme, 
partially purified from trout testis, was approximately 80 kD as compared to the human 
enzyme of 40 kD. Furthermore, KDN and Neu5Ac phosphate synthesis in trout were 
likely catalyzed by two separate synthetase activities (Angata et al., J. Biol Chem. 274, 
22949-22956 (1999)) while the current study indicates that both products were 
generated from a single human enzyme with broad substrate specificity. 
[0364] NeuSAc, usually bound to glycoconjugates, is the predominant sialic 

acid found in mammalian tissue, but KDN, primarily found free in the ethanol soluble 
fractions, has also been detected all human tissues examined so far (Inoue and Inoue, 
Sialobiology and Other Novel Forms of Glycosylation (Osaka, Japan: Gakushin 
Publishing, pp.57-67 (1999)). The ratio of NeuSAc to KDN is on the order of 100:1 in 
blood cells and ovaries (Inoue et al., 1998), although this ratio may change during 
development and cancer. The levels of free KDN in newborn fetal cord red blood cells 
are higher than those of maternal red blood cells (Inoue et al., J. Biol. Chem. 273, 
27199-27204 (1998)). Furthermore, a 4.2 fold increase in the ratio of free KDN to free 
NeuSAc was observed in ovarian tumor cells as compared to normal cells, and the ratio 
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appears to increase with the extent of invasion or malignancy for ovarian 
adenocarcinomas (Inoue et al., J. Biol Chem. 273, 27199-27204 (1998)). 
[0365] Because the KDN/Neu5Ac ratio has biological significance, we 

performed competitive in vitro assays with insect cell lysates using both ManNAc-6-P 
and Man-6-P as substrates. SAS demonstrated a preference for phosphorylated 
Neu5 Ac over phosphorylated KDN synthesis in vitro, although the concentrations of 
the particular substrates relative to the enzyme level altered this production ratio. Thus 
changes in the ratios of free KDN to NeuSAc observed in different developmental 
states and cancer tissue may reflect variability either in the levels of specific substrates 
or the amount of active enzyme present in vivo. The identification of the SAS genetic 
sequence and characterization of the enzyme it encodes should help further our 
understanding of sialic acid biosynthesis as well as the roles sialic acids play in 
development and disease states. 

[0366] In Figure 39 the production of sialylated nucleotides in SF-9 insect cells 

following infection with human CMP-SA synthetase and SA synthetase containing 
baculoviruses is demonstrated. Sf-9 cells were grown in six well plates and infected 
with baculovirus containing CMP-SA synthase and supplemented with 10 mM 
ManNAc ("CMP" line), baculovirus containing CMP-SA synthase and SA synthase 
plus 10 mM ManNAc supplementation ("CMP+SA" line), or no baculovirus and no 
ManNAc supplementation ("SF9" line). The nucleotide sugars from lysed cells were 
extracted with 75% ethanol, dried, resuspended in water, and filtered through a 10,000 
molecular weight cut-off membrane. Samples were then separated on a Dionex 
Carbopac PA-1 column using a Shimadzu VP series HPLC. Nucleotide sugars were 
detected based upon their absorbance at 280 nm, and CMP sialic acid standards were 
shown to elute at approximately 7 minutes. These results demonstrate the ability to 
produce the desired oligosaccharide products in insect cells via introduction and 
expression of sialyl transferase enzymes. 




84 



Materials and Method of Example 6 

Gene Characterization 
[0367] The E. coli neuB coding sequence was used to query the Human 

Genome Sciences (Rockville, MD) cDNA database with BLAST software. One EST 
clone, HMKAK61, from a human (liver) cDNA library demonstrated significant 
homology to neuB and was chosen for further characterization. The tissue distribution 
profile was determined by Northern blot hybridization. Briefly, the cDNA was radio- 
labeled with [ 32 P]-dCTP using a RediPrime™n kit (Amersham/Pharmacia Biotech, 
Piscataway, NJ) following the manufacturer's directions. Multiple tissue Northern 
blots containing poly-A+ RNA (Clontech, Palo Alto, CA) were pre-hybridized at 42°C 
for 4 hours and then hybridized overnight with radio-labeled probe at lxlO 6 CPM/ml. 
The blots were sequentially washed twice for 15 min. at 42°C and once for 20 min. at 
65°C in 0.1 X SSC, 0.1% SDS and subsequently autoradiographed. 

Baculovirus Cloning and Protein Expression 
[0368] The full length ORF was amplified by PCR using the following primers. 

The forward primer, 5 s - 

TGTAATACGACTCACTATAGGGC GG^tTCCGCCATCATGCCGCTGGAGCTGG 
AGC (SEQ ID NO: 13) contained a synthetic T7 promoter sequence (underlined), a 
BamHI site (italics), a KOZAK sequence (bold), and sequence corresponding to the 
first six codons of SAS. The minus strand primer, 5'- 

GTACGGZ4CC TTATTA AGACTTGATTTTTTTGCC (SEQ ID NO: 14), contained an 
Asp 718 site (italics), two in-frame stop codons (underlined), and sequences 
representing the last six codons of SAS. 

[0369] After amplification, the PCR product was digested with BamHI and Asp 

718 (Roche, Indianapolis, IN) and the resulting fragment cloned into the corresponding 
sites of the baculovirus transfer vector, pA2. Following DNA sequence confirmation, 
the plasmid (pA2-SAS) was transfected into Sf-9 cells to generate the recombinant 
baculovirus AcSAS as previously described (Coleman et al., Gene 190, 163-171 
(1997)). Amplified virus was used to infect cells, and the gene product was radio- 
labeled with [ 35 S]-Met and [ 35 S]-Cys. Bands corresponding to the gene product were 
visualized by SDS-PAGE and autoradiography. Alternatively, the PCR product was 
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used as a template for in vitro transcription and translation using rabbit reticulocyte 
lysate (Promega, Madison, WI) in the presence of [ 35 S]-Met. Translation products were 
resolved by SDS-PAGE and visualized by autoradiography. 

10370] For protein production, Sf-9 cells were seeded in serum-free media at a 

density of lxl 0 6 cells/ml in spinner flasks and infected at a multiplicity of infection of 
1-2 with the recombinant virus. A detergent fractionation procedure was employed 
(Miyamoto et al., Mol. Cell. Biol. 5, 2860-2865 (1985)) to separate nuclear from non- 
nuclear fractions. Protein was resolved by SDS-PAGE, transferred to a ProBlott™ 
membrane (ABI, Foster City, CA), and visualized by Ponceau S staining. A prominent 
band at the expected MW of -40 kD was visible and excised for protein 
microsequencing using an ABI-494 sequencer (PE Biosystems, Foster City, CA).- 

Neu5Ac/KDN Detection 
[0371] Sialic acid was measured by the procedure of Hara et al. (Anal. Biochem. 

179, 162-166 (1989). Ten microliters of sample were treated with 200 ul DMB (Sigma 
Chemicals, St. Louis, MO) solution (7.0 mM DMB in 1.4 M acetic acid, 0.75 M p- 
mercaptoethanol, and 18 mM sodium hydrosulfite) at 50°C for 2.5 hrs, from which 10 
^il was used for HPLC analysis on a Shimadzu (Columbia, MD) VP series HPLC using 
a Waters (Milford, MA) Spherisorb 5 urn ODS2 column. Peaks were detected using a 
Shimadzu RF-10AXL fluorescence detector with 448 nm emission and 373 nm 
excitation wavelengths. The mobile phase was an acetonitrile, methanol, and water 
mixture (9:7:84, v/v) with a flow rate of 0.7 ml/min. Response factors of Neu5Ac and 
KDN were established with authentic standards based on peak areas for quantifying 
sample sialic acid levels. Sialic acid content was normalized based on protein content 
measured with the Pierce (Rockford, IL) BC A assay kit and a Molecular Devices 
(Sunnyvale, CA) microplate reader. 

Cell Culture and Sialic Acid Quantification 
[0372] Sf-9 (ATCC, Manassas, VA) cells were grown in Ex-Cell™ 405 media 

(JRH Bioscience, Lenexa, KS) with and without 10% FBS at 27°C. CHO-K1 cells 
(ATCC, Manassas, VA) were cultured at 37°C in a humidified atmosphere with 5% 
C0 2 in Dulbecco's Modified Eagle Medium (Life Technologies, Rockville, MD) 
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supplemented with 10% FBS, 100 U/ml penicillin, 100 ug/ml streptomycin, 100 yiM 
MEM essential amino acids, and 4 mM L-glutamine (Life Technologies, Rockville, 
MD). Cells were grown to confluency in T-75 flasks, washed twice with PBS, and 
lysed in 0.05 M bicine, pH 8.5, with 1 mM DTT (Vann et al., Glycobiology 7, 697-701 
(1997)) using a Tekmar Sonic Disruptor (Cincinnati, OH). For determination of sialic 
acid content, 10 \i\ of lysates with and without 10,000 MWCO microfiltration 
(Millipore, Bedford, MA) were analyzed by DMB derivatization as described above. 
[0373] Sugar substrate feeding was studied by plating approximately 10 6 Sf-9 

cells on each well of a six well plate. Media was replaced with 2 ml fresh media 
supplemented with 10 mM sterile-filtered Man, ManN, or ManNAc. Cells were left 
uninfected or infected with 20 fj.1 of the appropriate (A3 5 or AcSAS) amplified 
baculovirus stock. Cells were harvested at 80 hours post infection by separating the 
pellet from the media by centrifugation and washing twice with PBS. Cells were lysed 
and analyzed for sialic acid content as described above. 

In vitro Activity 

[0374] In vitro activity assays were based on the procedure of Angara et al. (J. 

Biol. Chem. 274, 22949-22956 (1999)). Lysates were prepared from A35 and AcSAS 
infected and uninfected Sf-9 cells cultured in T-75 flasks with and without 10 mM 
ManNAc supplementation. After washing twice with PBS, cells were lysed on ice with 
25 strokes of a tight-fitting Dounce homogenizer (Wheaton, Millville, NJ) in 2.5 ml 
lysis buffer [50mM HEPES pH = 7.0 with 1 mM DTT, leupeptin (1 ug/ml), antipain 
(0.5 ug/ml), benzamidine-HCl (15.6 ug/ml), aprotinin (0.5 ug/ml), chymostatin (0.5 
ug/ml), and 1 mM phenylmethylsulfonylfluoride]. 5 ul of substrate solution was 
incubated with either 20 ul insect cell lysate (30 min.) or purified E. coli protein (60 
min.) at 37°C. The substrate solution contained 10 mM MnCl 2 , 20 mM PEP, and either 
5 mM ManNAc-6-P or 25 mM Man-6-P (Sigma, St. Louis, MO). ManNAc-6-P was 
prepared by acid hydrolysis of meningococcal Group A polysaccharide. The 
polysaccharide (15.5 mg) in 5.8 ml water was mixed with 770 mg of Dowex 50 H+ and 
heated for 1 hr. at 100°C. The filtered hydrolysate was dried in vacuo and the residue 
dissolved to give a solution of 50 mM ManNAc-6-P and stored frozen. Substrate 
solutions containing 25 mM Man and ManNAc were also used. Boiled samples were 



87 



used as negative controls. Following incubation, all samples were boiled 3 min., 
centrifuged for 10 min. at 12,000g, and split into two 10 |al aliquots. One aliquot was 
treated with 9 units of calf intestine alkaline phosphatase (Roche, Indianapolis, IN) 
along with 3 jalof accompanying buffer while the other aliquot was diluted with water 
and buffer. AP treated aliquots were incubated 4 hrs. at 37°C, and 10 |al of both AP 
treated and untreated samples were reacted with DMB as described above. 2 |al of the 
samples incubated with insect lysates and 10 (al of the samples incubated with bacterial 
protein were injected onto the HPLC for sialic acid analysis as described above. 
[0375] For substrate competition experiments, Man-6-P and ManNAc-6-P 

concentrations in the substrate solution were varied from 1 to 20 mM. In vitro assays 
were run with Sf-9 lysates as described above. Samples were treated with 7 |al buffer 
and 18 units of AP, incubated for 4 hrs. at 37°C, and analyzed for sialic acid content. 
Samples containing more than 1 mM ManNAc-6-P in the substrate solution produced 
high levels of sialic acid and were diluted 1 :5 before injection to avoid fluorescence 
detector signal saturation. 

Analysis with Aldolase Using HPAEC 
[0376] Sf-9 cells were grown in T-75 flasks and then infected with A3 5 or 

AcSAS or left uninfected in the presence or absence of 10 mM ManNAc. After 80 hrs., 
cells were washed twice in PBS and sonicated. Aliquots (200 \A ) were filtered through 
10,000 MWCO membranes, and 50 \x\ samples were treated with 12.5 pi aldolase 
solution [0.0055 U aldolase (ICN, Costa Mesa, CA), 1.4 mM NADH (Sigma, St. Louis, 
MO), 0.5 M HEPES pH 7.5, 0.7 U lactate dehydrogenase (Roche, Indianapolis, IN)] or 
left untreated and incubated at 37°C for one hour (Lilley et al., 1992). Samples were 
analyzed by HPAEC with a Dionex (Sunnyvale, CA) BioLC system using a pulsed 
amperometric detector (PAD-II) on a Carbopac PA-1 column. The initial elution 
composition was 50% A (200 mM NaOH), 45% B (water), and 5% C (1M NaOAc, 200 
mM NaOH) with a linear gradient to 50% A, 25% B, and 25% C at 20 min. A 6 min. 
50% A and 50 % C washing followed. Samples were normalized based on protein 
content by dilution with water, and 20 (il of each sample were analyzed. Ten \i\ of each 
sample were also derivatized with DMB and analyzed by HPLC as described above to 
confirm the elimination of sialic acids by aldolase treatment. 
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Exam ple 7: Characterization Of CMP-SAS And Its U se With SAS In 
Engineerine The Sialic Acid Metabolic Pathway 

[0377] Recombinant glycoproteins are increasingly used as therapeutic agents, 

and post-translational modifications to these proteins can significantly affect their 
properties and value. N-glycosylation is a particularly important modification that can 
affect solubility, biological activity, and in vivo circulatory half-life 1 . Mammalian cell 
lines produce glycoproteins with complex-type glycan patterns typically terminating in 
sialic acid residues. While insect cells N-glycosylate secretory and membrane proteins, 
the final N-glycosylation pattern includes mostly high mannose or paucimannosidic 
(low mannose) structures 2 ' 3 . Some oligosaccharides terminating in N- 
acetylglucosamine or galactose have also been observed 2 ' 3 . Expression of recombinant 
galactosyltransferases has increased the relative amounts of galactose terminating N- 
glycans 4 " 6 . The differences between mammalian and insect glycosylation can have a 
significant impact on a protein's characteristics. For example, the in vitro activity of 
recombinant human thyrotropin expressed in insect cells was five times higher than that 
produced by Chinese hamster ovary cells (CHO), yet the in vivo activity of CHO 
thyrotropin was higher in mice 7 . The reason for this difference in activity was thought 
to be a more rapid clearance of the insect produced thyrotropin due to the absence of 
terminal sialic acid residues 1 . 

[0378] The sialylation of glycoproteins requires the metabolic generation of the 

sugar nucleotide cytidine monosphospho-sialic acid (CMP-SA) followed by the transfer 
of the sialic acid to an acceptor oligosaccharide in the Golgi apparatus by 
sialyltransferases. Previous studies indicate that insect cells have undetectable levels of 
the most common sialic acid nucleotide, CMP-A^acetylneuraminic acid (CMP- 
NeuS Ac) 8 as well as negligible levels of its metabolic precursor, NeuSAc 9 . Since 
CMP-Neu5Ac and NeuSAc are not easily incorporated into cells 10 , alternative strategies 
should be considered for generating sufficient NeuSAc and CMP-NeuS Ac for 
sialylation in insect cells. One approach is to engineer the necessary enzymatic 
pathways for CMP-SA synthesis. Unlike NeuSAc or CMP-NeuS Ac, the precursor to 
NeuSAc, 7V-acetylmannosamine (ManNAc) 10 , and its analogs 11,12 are readily 
incorporated into cells. In fact, feeding of ManNAc in concert with the expression of 
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the recently cloned sialic acid phosphate synthase gene generated large intracellular 
pools of NeuSAc in insect cells 9 . In the case of mammalian cells, NeuSAc is 
subsequently activated to the nucleotide sugar CMP-Neu5 Ac by CMP-sialic acid 
synthase 13 . While the activity of CMP-SAS has been described since 1962 14 , the gene 
has only recently been cloned in E. coli x$ and mouse 16 . 

[0379] Using homology searches based on the E. coli sequence (neuA), the 

novel human Cmp-Sas gene of the present invention was identified and cloned it into a 
transfer vector to produce the recombinant baculovirus (AcCMP-SAS). In order to 
determine if insect cells could be modified to generate CMP-SA pools, Sf-9 cells were 
co-infected with AcCMP-SAS and the baculovirus containing the recombinant human 
sialic acid phosphate synthase (AcS AS) of the present invention in the presence of 
ManNAc. The result was the production of large pools of intracellular CMP-NeuSAc, 
overcoming a major limitation to the sialylation of glycoproteins in insect cells. 
Interestingly, another sugar nucleotide, CMP-2-keto-3-deoxy-D-g/yc^ra-D-ga/ac/o- 
nononic acid (CMP-KDN), was generated in insect cells following infection with 
AcSAS and AcCMP-SAS when not supplemented by ManNAc. The ability to generate 
either CMP-NeuS Ac or CMP-KDN in separate cell cultures presents the potential for 
generating glycoproteins with different sialic acid termini in insect cells. 

Experimental Protocol of Example 7 

Gene Identification, Characterization, Cloning, and Expression 
[03801 The E. coli neuA coding sequence was used to query the Human 

Genome Sciences, Inc., (Rockville, MD) human cDNA database with BLAST software. 
One EST clone from a human prostate cell line demonstrated significant homology to 
neuA and was further characterized. The procedures used for Northern blotting, in vitro 
transcription and translation, and baculovirus cloning were the same as those described 
for work with For PCR amplification, the forward primer, 5'- 

TGT AAT ACG ACTC ACT AT AGGGC GGA TCCGCC ATC ATGG ACTCGGTGG AG A 
AGG, contained a synthetic T7 promoter sequence (underlined), a BamHI site (italics), 
a KOZAK sequence (bold), and a sequence corresponding to the first six codons of 
Cmp-Sas. The reverse primer, 5'- 

GTACGGr^CCTTACTATTTTTGGCATGAATTATTAACTTTTTCC, contained an 
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Asp 718 site (italics), two in-frame stop codons (bold), and sequence representing the 
last six codons of Cmp-Sas. 

Enzyme Localization 

[0381] Protein localization was determined with the Pierce NE-PER™ kit and 

modifications of a previously published procedure 19 . T-175 flasks were plated with 
16xl0 6 Sf-9 cells, and the media was replaced. Two flasks of cells were infected with 
A35, AcSAS, or AcCMP-SAS amplified viruses for approximately 60 h. Cells were 
washed twice with PBS and incubated in 3 ml of the described hypotonic buffer 19 . 
After centrifugation, 400 ul of stripping solution (10 mM Tris HC1 pH 7.3, 1% NP-40, 
0.5% deoxycholate, 10 mM NaCl, 1 .5 mM MgCl 2 ) was added to the pellet. The 
supernatant was removed and the soluble nuclear fraction taken after incubation with 
200 ul of the described lysis buffer 19 . The protein content of all fractions was 
determined using the Pierce (Rockford, IL) BCA assay kit with a Molecular Devices 
(Sunnyvale, CA) 96-well plate reader. Total protein from each fraction were analyzed 
by 12.5% SDS-PAGE. Samples were transferred to a PVDF membrane, stained with 
Ponceau S, appropriate bands excised, and submitted for Af-terminal protein sequencing 
using an ABI-494 sequencer (PE Biosystems, Foster City, CA). 

CMP-Sialic Acid Synthase Assay 
[0382] To assay CMP-sialic acid synthetic ability, CHO-K1 cells (ATCC, 

Manassas, VA) were grown to confluence in T-75 flasks at 37°C in a humidified 
atmosphere with 5% C0 2 in Dulbecco's Modified Eagle medium (Life Technologies, 
Rockville, MD) supplemented with 10% FBS, 100 U/ml penicillin, 100 ug/ml 
streptomycin, 100 uM MEM essential amino acids, and 4 mM L-glutamine (Life 
Technologies, Rockville, MD). After washing with PBS, 1 ml of suspension solution 
(02.M Tris, 0.1 5M NaCl, 1% NP-40 pH = 9.0) 27 was added. Sf-9 cells were plated in 
6-well dishes with fresh Ex-Cell 405 media (JRH Biosciences, Lanexa, KS) and 
infected with the A35 control virus or AcCMP-SAS virus or left uninfected for 60 h. 
The cells were washed with PBS, and the contents of two wells were harvested with 
400 ul of the resuspension solution in each well. The resulting mixtures were 
centrifuged 15 min at 14,000 RPM, and the supernatant was used for enzyme assays. 
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The protein content of each sample was determined as previously described. The 
activity assays were modified from the TBA assay used to measure CMP-SA 
formation 17 . Twenty [i\ of lysates were incubated with 100 |al of substrate solution 17 at 
37°C for 40 min. Forty pi of 1.6 M NaBH 4 was used to reduce free NeuSAc, and the 
mixture was then treated with 40 jal of 20 N H3PO4. Released NeuSAc was oxidized 
with 0.1 ml 25 mM periodic acid for 30 min at 37°C then treated with 0.1 ml of 62 mM 
sodium bisulfite. 1 .0 ml of 0. 1 M TBA was added, and the mixture was placed in a 
boiling water bath for 7.5 minutes. The tubes were placed on ice and 2 ml of DMSO 
with 5% concentrated HC1 was added. The absorbance at 552 nm was taken using a 
Spectronic 20 spectrophotometer (Bausch and Lomb, Rochester, NY), and the amount 
of CMP-Neu5Ac formed calculated by NeuSAc standards. 

CMP-Sialic Acid Levels 
[0383] Sf-9 cells were grown in 6-well plates and infected with combinations of 

A35, AcSAS, and AcCMP-SAS viruses along with 10 mM ManNAc feeding for 
approximately 90 hr. CHO cells grown according to the culture conditions described 
above were grown in T-75 flasks with and without 10 mM ManNAc supplementation 
for 48 hr. All cells were washed with PBS and aliquots taken for protein quantification. 
Cells were resuspended in 75% ethanol and sonicated on ice for 30 s. Lysates were 
centrifuged, frozen in liquid nitrogen, and lyophilized. Samples were resuspended in 
120 \i\ of 40 mM phosphate buffer, pH 9.2, to stabilize CMP-sialic acids. Samples 
were centrifuged again and the supernatant filtered through 10,000 molecular weight 
cut-off membranes. CMP-sialic acids were detected using HPAEC (Tomiya, in 
preparation) and quantified versus the response curves of standard CMP-Neu5Ac 
(Sigma, St. Louis, MO) and CMP-KDN (Yasuhiro Kajihara, Yokohama City 
University, Japan). 

Results of Example 7 

Gene Identification. 
[0384] The human CMP-sialic acid synthase gene was identified using 

homology searches based on the known bacterial CMP-sialic acid synthase gene 15 . 
Over the entire sequences, the putative human gene is 23% identical and 47% similar to 
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the bacterial sequence and 92% identical and 94% similar to the murine sequence. 
Northern blots probing human mRNA samples using the gene sequence showed 
ubiquitous transcription of the gene in all human tissues tested. 

[0385] Using in vitro transcription and translation, the putative gene was shown 

to encode a protein of the expected molecular weight. The same gene was used to 
generate AcCMP-SAS, and this virus was used to infect Sf-9 cells. Pulse labeling 
followed by SDS-PAGE separation showed a protein of 49 kDa 3 consistent with the 
predicted weight of 48.5 kDa, that was not observed in the control infection (A3 5). 

CMP-Sialic Acid Synthase Activity. 
[0386] The function of the putative gene product was determined in vitro using 

cell lysates incubated with the precursors iV-acetylneuraminic acid and cytidine 
triphosphate based on the published thiobarbituric acid (TBA) assay 17 . CMP-SA 
synthetic ability is observed only in Sf-9 cells following AcCMP-SAS infection and 
was not detectable in control infected or uninfected cells. Furthermore, infection with 
the AcCMP-SAS provides 60 times the activity of the native CMP-SA synthetic ability 
ofCHO cells. 

Enzyme Localization. 
[0387] CMP-sialic acid synthetic activity in mammals has been shown to 

localize to the nucleus 13 , and fluorescent labeling of the recombinant murine enzyme 
expressed in 3T3 cells confirmed this localization 16 . Analysis of the human CMP-SAS 
sequence with the pSORT algorithm also predicted the protein would localize to the 
nucleus because of the nuclear localization sequence (KRPR) starting at amino acid 
199. To determine if the recombinant human enzyme localized to the nucleus of insect 
cells, a nuclear separation using the Pierce NE-PER™ kit was performed. A protein 
band from the AcCMP-SAS infection was found predominantly in the nuclear fraction 
but not in the nuclear fraction of a control infection. In contrast, recombinant SAS 
localized in the cytoplasmic fraction and not the nuclear fraction as observed previously 
for the native enzyme in mammalian cells 18 . TBA assays of the nuclear fraction 
showed CMP-SAS activity of AcCMP-SAS infected but not AcSAS infected cells. 
The protein band in the nuclear fraction following a detergent separation 19 was also 
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transferred to a PDF membrane for Af-terminal protein sequencing. The sequence was 
identical to amino acids 29 through 33 as predicted from the gene sequence indicating 
cleavage of the first 28 amino acids of the nuclear localized protein. 

Completion of the CMP-Sialic Acid Pathway in Insect Cells. 
[0388] The production of NeuSAc in insect cells has been accomplished by 

infection with the recombinant human sialic acid phosphate synthase virus (AcSAS) in 
concert with feeding of the precursor ManNAc 9 . Coinfection of AcSAS and AcCMP- 
SAS with concomitant ManNAc feeding was attempted in order to produce and activate 
sialic acids in insect cells. Proteins of the expected molecular weights and cellular 
locations are seen in lysates of cells infected with each virus individually and together. 
[0389] A procedure was developed to measure CMP-Neu5 Ac by high 

performance anion exchange chromatography (HPAEC) using UV monitoring. Using 
this technique, samples from different infection strategies were examined, and the 
levels of CMP-Neu5Ac produced on a protein basis are listed in Table 5, below. In 
agreement with previous work 10 , CHO cells are shown to have measurable CMP- 
NeuSAc levels which are augmented upon feeding of 10 mM ManNAc. Uninfected Sf- 
9 cells grown in serum-free medium do not have detectable CMP-Neu5Ac levels with 
or without ManNAc feeding. Large CMP-Neu5 Ac levels are produced in insect cells 
only when all three components of the sialic acid pathway are present: the precursor 
sugar ManNAc, the sialic acid phosphate synthase from AcSAS, and the CMP-sialic 
acid synthase from AcCMP-SAS. In this case, CMP-Neu5 Ac levels are approximately 
6 times those seen in CHO with ManNAc feeding and 30 times without ManNAc 
feeding. 

[0390] When ManNAc feeding is omitted from an AcSAS and AcCMP-SAS 

infection, a CMP-Neu5Ac peak is not observed but a peak eluting at 8.5 min. is 
detected. This peak has the same elution time as authentic CMP-KDN and is found in 
quantities of 4 pmol/jag protein. Further confirmation of the peak as CMP-KDN was 
obtained by acid hydrolysis of the eluted peak followed by fluorescent labeling with 
DMB and HPLC separation. The resulting fluorescent compound eluted at the same 
elution time as authentic KDN (data not shown). 
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Table 5. Intracellular CMP-Neu5 Ac levels of CHO and Sf-9 cells using the indicated 
feeding strategy and infection scheme. The CMP-Neu5Ac content of cell lysates was 
determined by HPAEC. 



Sample 


CMP-Neu5 Ac pmol/ng protein) 






CHO 


0.30 


CHO + ManNAc 


1.77 






Sf-9 


<0.02 


Sf-9 + ManNAc 


<0.02 






Sf-9 + A35 + ManNAc 


0.02 


Sf-9 + AcSAS + ManNAc 


0.02 


Sf-9 + AcCMP-SAS + ManNAc 


0.10 






Sf-9 + A35 + AcSAS 


<0.02 


Sf-9 + A35 + AcCMP-SAS 


<0.02 


Sf-9 + AcSAS + AcCMP-SAS 


<0.02 






Sf-9 + A3 5 + AcSAS + ManNAc 


0.08 


Sf-9 + A35 + AcCMP-SAS + ManNAc 


0.04 






Sf-9 + AcCMP-SAS + AcSAS + ManNAc 


10.30 



Discussion of Example 7 
[0391] The baculovirus expression system offers several advantages including 

the capacity to generate significant levels of post-translationally modified recombinant 
proteins in a eucaryotic cell line. However, one of the drawbacks of recombinant 
baculovirus expression is its inability to produce glycoproteins with sizable fractions 
having complex-type Af-glycans, especially those terminating in sialic acid residues. 
Sialic acids have been associated with numerous in vivo biological processes including 
development 20 , pathogen infectivity 21 , ligand-receptor interactions 21 , cancer 
metastasis 22 , and the elimination of non-sialylated glycoproteins by the 
asialoglycoprotein receptor 7 . 

[0392] Although Sf-9 cells do not contain detectable native pools of CMP- 

Neu5 Ac, this bottleneck to sialylation has been overcome by substrate feeding and 
engineering the CMP-SA synthesis genes into insect cells. The human CMP-sialic acid 
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synthase gene has been identified and expressed in an active form in insect cells. The 
enzyme localizes to the nucleus of insect cells as observed in the native mammalian 
host to suggest that the encoded nuclear localization signals are functional in insect 
cells as well. Moreover, the protein isolated from the nuclear fraction retains enzyme 
activity, but interestingly its first 28 amino acids are truncated as determined by N- 
terminal protein sequencing analysis. This cleavage may explain the lower molecular 
weight of the protein band observed in the nuclear fraction of the Coomassie stained gel 
as compared to the radiolabeled protein in the total cell lysate. The cause of the 
cleavage of the nuclear localized CMP-SA synthase is unknown as is the relevance of 
this cleavage to the nuclear localization process in insect cells. 
[0393] By feeding ManNAc along with expression of CMP-SAS and SAS, 

CMP-Neu5Ac levels more than five times higher than those observed in ManNAc 
supplemented CHO cells are achieved in Sf-9 cells. Some low level endogenous sialic 
acid metabolic activity may be present in insect cells as seen with the addition of 
ManNAc and just one component of the pathway (Table 5). However, CMP-levels are 
increased more than 100- fold by the addition of all three components. In order to 
complete sialylation, the CMP-Neu5 Ac must be transported into the Golgi where a 
sialyltransferase can transfer the Neu5Ac to the target acceptor oligosaccharide. 
Expression of recombinant sialyltransferases and a CMP-sialic acid transporter may 
also be required to achieve sialylation in insect cell culture. 

[0394] Interestingly, an alternative nucleotide sugar, CMP-KDN, is produced in 

insect cells co-expressing CMP-SAS and SAS without ManNAc feeding. Previously, 
we observed the production of KDN when insect cells expressed SAS in the absence of 
ManNAc. Therefore, both SAS and CMP-SAS must recognize the substrates for 
generating deaminated sialic acid forms. More significant is the observation that the 
CMP-KDN is generated in the absence of detectable levels of CMP-Neu5Ac. This is 
the first report of a cellular system that can produce substantial levels of activated KDN 
without activated Neu5Ac. Since at least one sialyltransferase recognizes both CMP- 
KDN as well as CMP-Neu5Ac 23 , this expression system may eventually allow the 
exclusive KDNylation of glycoproteins. KDN was first identified in 1986 24 , and its 
effects on function and activity of a sialylated glycoprotein are largely unknown. 
However, elevated KDN:Neu5Ac levels have been observed in tissues during 
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biological phenomena of great interest such as cancer and development . In addition 
glycoproteins sialylated by KDN rather than Neu5 Ac may be less susceptible to 
sialidases 24 . Such proteins could have longer in vivo circulatory half-lives and 
potentially greater pharmaceutical value. 

[0395] Furthermore, the capacity to enhance sialylation by engineering the 

CMP-SA pathway may have significance beyond the baculovirus expression system. 
Reports have suggested that limitations in levels of activated sialic acids may exist in 
mammalian cell lines as well 26 . Other expression systems such as yeast and plant cells 
may also lack some of the genes involved in generating CMP-SA so the pathway may 
be engineered into other recombinant hosts. In these cases, the sialyltransferases and 
the enzymes involved in sugar nucleotide transport may also be needed. Thus, in 
accordance with the invention, the number of cellular species that can generate complex 
"mammalian-like" glycoforms during recombinant protein expression may be increased 
through the metabolic engineering of the CMP-SA synthetic and transfer pathways. 

Conclusion of Example 7 

[0396] In summary, to increase substrate levels, the enzymes of the metabolic 

pathway for CMP-SA synthesis have been engineered into insect cells using the 
baculovirus expression system. A novel human CMP-sialic acid synthase gene (Cmp- 
Sas) was identified based on homology to the corresponding E. coli gene. The enzyme 
is ubiquitously expressed in human cells and is active in baculovirus-infected insect 
cells where it localizes to the nucleus as it does in mammalian cells. Co-expression of 
Cmp-Sas with the novel sialic acid phosphate synthase (SAS) of the present invention in 
concert with Af-acetylmannosamine (ManNAc) feeding yields intracellular CMP- 
NeuSAc levels 30 times higher than those observed in unsupplemented CHO cells. The 
absence of any one of these three components abolishes CMP-Neu5Ac production in 
vivo. However, when ManNAc feeding is omitted, the sugar nucleotide form of 
deaminated Neu5 Ac, CMP-2-kQto-3-dcoxy-D-glycero-B-galacto-nonomc acid (CMP- 
KDN), is produced instead indicating that alternative sialic acid glycoforms may 
eventually be possible in insect cells. Engineering the CMP-SA metabolic pathway 
may be beneficial in various cell lines in which CMP-Neu5 Ac production limits 
sialylation of glycoproteins or other glycans. 
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[0397] The entire disclosure of all publications (including patents, patent 
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[0398] Further, the Sequence Listing submitted herewith is hereby incorporated 

by reference in its entirety. Additionally, the specification and sequence listings of U.S. 
Provisional Applications Nos. 60/227,579; 60/169,624; and 60/122,582 and of U.S. 
Application No. 09/516,793 are all hereby incorporated by reference in their entireties. 



